'lrtO?JI4B(S)l*ll»»:B S K 8 i|:SIW»IBt»-(t) !:(» P. 90 



PCTAJS97/03 313 




'03$07jjl4B(j3)Uff24# S K B 



R : 6 2 4 P. 91 




i 




FEE lECOtP sgggr 




'03*0781 4B(J!) 14*240 J$:B S K B 



11:624 P. 92 




r ? >* 



I- 1 



Pf&VlSIONALPATEMTAPf>UCATKM 



Voli mth 



Erie 



Mm 

A a 

ft. 

















MM 


BftCM- 






MEWVOAK 




USA 











1* 

* ■ 

'» » v. 
J 



ENCLOSED MPUCAT1QN PARTS Mm* at ~ Tglt 



(ntBlfOiMM) 



EM 



(pif* 53) 



[ 



ttW m*irtfc Dm— Apart Nunwr B^OMO, 



□ 



t*m* ()19) 541-8614 
dm: 7^mAT7 24. 1996 



□ 



'03$07j}14a(jl)US§24# fl$:B S K B 



R : 6 2 4 P.93 



0; 





10 



15 



20 



OMOTERS FROM PLANT PROTOPOKPRYUNOCBN OXD^I GtNg, 

FIELD OF THE INVENTION 
This invention relates to novel DN A sequences which fwn'Jwm • piMttlcfi of 
mmc ri pti on of taso ri a rrd DN A t eq u cno c* in plants, More specifically, this iiiuMiofi rebse* to 
novel promo ter * which ire naturally ts*ociated with pUat prttoporph jriaoftfl oxide* (pratox) 
coding sequence!. 

BACKGROUND OF THE INVENTION 
L 1W Protas Emjuie and tta Iawotvenetil In 



The btosyntbetic pathv/ay> which le*i Id the pTOotaBOD OtfcfaJflrophyfl od IWC attJC A 

number of Common steps. Chkmpftyilualigtehara 
phoUMynthetk oigan^ 

oxygenases, peroxidases, raid cataiases (jee, I^hrtmger, BtQGtaU&X- Worth Pobthfeers, 
New York (1975)), ind is therefore a necessary cornpuncut for aD aetobic organisms. 

The Ust common step in chlorophyll sod hcrobio«yiithc^U tfacoaklMwa of 
proioporphyrinoflen IX to protoporphyrin DC Ptettipoipbyrinogcn oxidase (referred to herein as 
"protox") is che enzyme which catalyzes this last oxidation step (Mstringe et dL. Biocho*- J. 260: 
231 (1989))* 

The protoxenzyin his been purifkde^ 
organisms including the yeast Saceharomyces ccrtvisiac (Labbe-Bois and Labbt, In BiogYmhesis 
r>f Heme «id ChloToohvll . EJi Diiley, ed. McGriw Hill: New York, pp, 23S-285 (1990)), barley 
etioplasts (Jacobs and Jacobs. BiochatL J. 244: 219 (1987)), and mouse liver (Dailey Ad Kan, 
Biochm. 26: 2697 (1987)). Gtaes encoding protox ba^ 

orgsnisms* Escherichia coU (Sasartaan *taL,C*Lj> MicrobioL 39: 1 155 (1993)) vdBaciUui 
subtilis (Dailey cteLJ. Biol CW 269: 813 (1994)). These genes share no sequence similarity; 
neither do their predicted protein products share any amino acid sequence identity. TheE. eoli 
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protein ii appro slmtfely 21 kDt, and aNociigBi wift fte cell 
51 kTX and is a aotubfc. cytoplaemic activity, 

FlXKox encoding cDNAi hm now eko been itoUied from 
X JM. Chew. 270(14): 8076-*t*0 (1995) ml pim* (laterMCkNnl 
9 PCI7IB9S/0O452 f* i line 8, 1995, published Dec. 21 , 1995 m WO 95/34659). 



II. TbePratox Geatosa HertlciifcTtryil 

The use of herbicide* to control mfctixtbfe vefttsiM 
10 hi* become aluttttmrvefialpncticn. The rekvMit msrkri exceed* i bilbon dollars ■mufly. 
Despite this extensive use, weed control ftnwni a significant md costly problem tam 

Effective u>e of bigbicktoroqutw Fof instance, time and method of 

application md stage of weed plant development arc critical to getting good weed tautrot with 
- herbicides, Sine* wincm weed spott 
15 herbicides becomes inenmngry important 

Unfortunately, herbicides due exhibit greater potency, brooder weed spectrum mdtaott 
rapid cfegrtdatiofl in soil can also hive greiter crop phyrotoxkity. One totaftoo ifpUed to du 
problem has been to develop crops which aretcsistamcf totetanttoherbfcides. Crop faybrids or 
varieties fesistam to the berbi^ 
20 ojmagetothecrop.Devetopinemtf a herbicide to a crop 

where its use wis previously precluded or limited (cj. to ptc-emergeoce use) due to se n si tivity 
of the crop to the herbicide. For example, US. Patent No. 4,761,373 to Anderson etaLl* 
directed lo plants resistant to various imidaxolinone or sulfonamide herbicides. The resistance is 
conferred by an altered aoetDoydroxy*:id synthase (AHAS) enzyme. LLS. Patent No. 4,975,374 
25 to Goodman ttal relates to plam cells and plants containing a gra 
synthetase (GS) reustant to irihibiti^ 

phosphinotbririn and methionine sulfoximine, Patent No. 5£13AS9toBttfcroolttfiaL is 
directed to plants thai express a mutant acctol*ctate synthase which renders the plants resistant to 
inhibition by sulfonylurea herbicides. U.S. Patent No. 5,162,602 to Somers ef aL discloses plants 
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UMCJ AIR IQ inDJWUOB UJ cyclCflRXMBQICiBi n SJIOn J|pnxypra^BMHC »*v m^MCl^M. IIW 

tolerance it conferred by in shared acetyl umuym A carboxyla—fAGC^). 

The protox enzyme serves at the target for a v«*y of hcrbicidal mtiyuumk The 
herbicides that inhibit protox include many different ftmctwii daaees of molecules (Dokt gt aL, 
Weed Sci 39:465(1991); NandihaOi €t aL. PesfkUt Biad^ PkyglpL 43^n (\my 9 
Muring* # aL, FEBS UtL 245: 35 ( 1 9*9); Yaneae aid Andoh, Pesticide AteW ft*** J5: 
70(1989)). Thcac hcrbicidal compounds include me dip h uiyfa l ma (eg. arifltsortw, 5^ 
chkiro^rifluoremefhy l)prcnOTy).2Hiitroc«2ok: acid; it* methyl easer; or Myfhmta. 2<Mon>» 
lH3"Cthc*y^itrc?he^ owdiatoa, HZUfcriik^ 

HlH»emy)ethc*y)pbe^ (Cf. 
S-23142, //^4<hkro>24]U0rth5-procwy 

chtoroprrthalim, ^4«chkMopbLTy1)-3.4^ phenyl pytrote* (eg. TNPP- 

ethyl, ethyl 2-[IK2A4*rkMc*oc*^ MAS 39279), 

pyridine, derivatives (c~ft. LS and pbenopylu end its CHshe^rpynoik&lo- ad 

prperidinocarbarnate enalogs. Many of theae compounds cm y i pft li t i very mhsbit the normal wa cte ai 
catalyzed by the enzyme, apparently acting as subeme analogs, 

Typically, tte inhibitory effect on protox is determined by measuring fluorescence at about 
622 u> 635 nM. after exritriootf e,g. Jacobs and Jacobs, Enyzme 2& 

206 (19S2); Shcnnan et at, Plant PhysioL 97: 280 (1991)X This assay abased on tbofwtbai 
protoporphyrin IX is a fluorescent pigment and protopeephyrinofen DC is noafluofesceaL 

The predicted mode of action of" proton-inhibiting heibtcitics involves the mtu mailman of 
pTutcpotphyhno^en DC in the chloroplasL This accumulation is thought to kad to leakage of 
pitnoporphyrinogen DC into the cytosol where ii is oxidized by a peroxidase Activity to 
protoporphyrin DC. When exposed to light protoporphyria DC can cause formation of sangjei 
oxygen in the cytosol. litis singlet oxygen can in Cum lead to the formation of other reactive 
oxygen species, which can cause lipid peroxidation and membrane disruption leading to rapid cefl 
death (Lee ei at. Piatt Physu>L 102: ggl (1993)). 

Km all protox enzymes are sensitive to herbicides which inhibit plant protox enzymes. 
Bc« of the protox enzymes encoded by ge^ 
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Can. / MknbtoL 39: 1 13J (1993)) mi BmcWm **Oit (Daiby tiwi.J.moL Omm. W. *I3 
(i994))««rfn«^tolimMikidAlMiibNn fcrfdi^ matt rfteaMkcIblv rip 

CMtBupdoiftmaf rtmhfifdtil reatatant lo the phewf tiwidc fcaririetde &23142 he*v been reported 
(Ktiaoki * 4, I PttheiOe Set !J: 449 (1990); Sttb*««£, to BotfiaaAanXMlniL 
Vd. m. N. Mrnta, ad. KhtwtrNetheriMk. pp. 567-570 (1992)). At fc* oftcof (hew owe* 
ippf in to have m attend protox Activity dial it fpejatam MX only to the herbicidel mliiWior on 
which the mutam wax eekcted, to {Q*MOCiaL,Z. 
Naturfmch 48c: 339 <t993); Sato «*L. In Af? ffYTIIffiirTt ^naTYTf p***-^ S, Dyke. 
e& ACS Prea: Wejbingfaa, D.C (1994)). AintitirtMtemccfl faehv 
iin^stim to the inhibitor S-21432 {CU&sL, ZMatofondL 4fc 350(1993). battfttot, 
isodjfied, tnhibitoi -ftiifltaat f onw 

tnteroctiawl tpfMkaftkm no. PCT/IB95A04S2 filed Jim* 8 # 19M.jnbtiihedDac.2U 1995 » WO 
95/34659- 



IIL RcgwtaHflW of Pretax Gcae ExprvaakT 

The bulk of the research related to the pretax jew whidih«beeocood»a£dthw^ 
his focused upon the coding • c qu coce and mnriifictrtow to ifan enzyme which nsy iBtfkr it 
resistant to protot inhibitors. No information u available in the m with regard to the regulatory 
elements which control and promote the expression of protox coding se qu ence* to plasm 

SUMMARY OF THE BWENTION 

The present invention is based on the discovery that the promoter region* naturally 
essociatwi with the plant ptotoporphyriDOgen oxidate (protox) coding sequences, referred to 
herein generally as the "protox promoter'' , arc uteful for promoting exprcuioe of a betcrologgiii 
coding sequence in a plant. 

In Accordance with thi* discovery, the present hmj^ 
molecule u w p r u m g g plant pretax p totoo ter . The present invention further provides a chimeric 
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gene cofnpriaing t plans putt* promoter t^ecsWy Hnaud m> a 
PUm tissue Mid plants eontttntog iddi i ehimerie |« n itoo pfwidtf. 

In one aspect of the invention the pnxox promom* is need to 
fdfini of herbicide ttrytt proteins to • plant to confer tolerance to fte herbicide, Aoconlmf to 
this aped, the precox pramoiu may be opcrnbly linfcrd lo a coding sequence for a herbicide* 
resistant plant protox proscip which ii resistant to inhibitors of wunudifkd plant 

DESCRIPTION OF THE SEQUENCE LWIWG 



10 SEQIDNo. 1: DNAc»d^sc*j«oce faran^ni*^ 
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SEQ ID No. 2: 
SEQIDNo.3: 
SEQIDNo.4: 

SEQ ID No* 5; 
SEQ ID No. 6: 
SEQIDNo.7: 
SEQ ID No. 8: 
SBQ ID No, 9: 
SEQ JD No. 10: 
SEQ ID No. It: 
SEQ ID No. 12: 
SEQ ID NO. 13: 



AroMdcprif theluma precox- 1 sninoacid 



1. 

DNA coding 



by SEQ ID No, 



for en ArabidoptU theLana procox-2 pratcm* 



Arvbidoptis thaiiana protox»2 ammo acid sequence encoded by SEQ ID 
NoJ 

DNA coding sequence for a maiae pctrtox-1 proton. 

Maize protox-1 amino acid sequence encoded by SEQ ID Na 5 

DNA coding sequence for t maize protox*2 pnneni. 

Maize protox-2 amino acid sequence encoded by SEQ ID No. 7 

DNA coding sequence fori wheal protas-1 protein. 

Wheat proton*! amino acid sequence encoded by SEQ ID No, 9* 

DNA coding sequence for a soybean protox-1 protein. 

Soybean pretax-] protein encoded by SEQ ID No. 11, 

Promoter sequence torn AwbLfopsi* ihaliwvcata*-l pnx- 



03f07fil4H(g)Hft6# 1K:B S K B 




R : 62 4 



P. 98 



r 



DEFINITIONS 

As used herein a "plant protox promoter* is tiled to refer to the regulatory region which 
naturally occurs immediately upstream of a ptocopotphyripogen oxidate (prota*) coding sequence 
in i plant and is responsible, in its oatvraliy occurring state, for regulating the trncnpckm of the 

|s Mioriatfd pretax coding sequence. The plant proiox promote tnclodes the DNA region directly 
involved in binding of RNA polymerase to initiate hanscriptkw endaddhkipal qpstieim 
teguUtory cis-clemcms which influ 

As used herein i "gene" is wed to icfcr to * DNA moleoikwhiehbid»des(t)ecodinf 
sequence and (2) associated regulatory regions which promote md regulate the uau a uipti oQ of 

10 the coding sequence in a suitable host cell The coding sequ ence may encode a oscr^ transcript 
(eg. amisense RNA) or polypeptide produced by translation of the encoded tmacnpL A gene 
includes at a minimum, in 5*« T orientation, a promoter legion, a coding sequ ence and a 
umcripti on tenninator. A gene may also include additional regulatory region* which can occur 
as part of the minima] elements (e,g. leaden or signal peptides within the coding sequence) or as 

15 discrete dements (eg. mtrons). 

As used herein a "chimeric gene * refers to a gene which does not naturally occur 
wherein at least one component part is heterologous with respect to another comp on en t part- As 
used herein Co describe the present invention a "chimeric gene" refers to a gene which includes the 
promoter of the invention operably linked to a heterologous coding sequence. 

20 As used hercift with reference to the relationship between a promocerand a coding 

sequence, the term *hetero*Og<3U$" is used to refer to i relationship which does not naturally . 
occur. For instance, a coding sequence is considered heterologous with respect to a promoter 
sequence if it is different from the coding sequence that naturaOy occurs in association with the 
poinatef sequence. This includes modified forms of coding sequences which are naturally 

25 associated with a subject promoter. Accordingly, a modified, inhibitor-resistant protox coding 
sequence is coosiderea to be heterologous with respect to the rsomoter diet is naturally 
associated with the unmodified, uihibitor-tcasitivc form of this coding sequence. 
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As used herein* the terra 'substantial icqucncc homology* is used to indies* that a 
nucleotide sequence (in the case of DNA or RNA) or m arnino arid sequence (in the cm of t 
protein or polypeptide) exhibitB substn 

nucleotide or amino arid sequence* Any functional or structural differences between sequences 
5 having substantial sequence homology will be de minimis; mat b they will not affect the ability of 
the sequence to function as in di c a ted in the present application. For example* A sequence which 
has aiibatantial sequence bor^^ 

will be able to direct the same level tnd pattern of expression of tn associated DNA sequence as 
the plant protox promoter Sequence dm have subsxantial sequence homology with the 

1 0 sequences diarloanri herein are usually variants of the disttoard sequence, such as inflati o ns , but 
may also be syndic** scqTjencea. Structural differences tncoaaifcrridciaa^ dtbazh* 
significant amount of rrrrT overlap or similarity between two or more different s equences or 
if trie differ sequence Sttchehnncteristics 
for example, rarnimotogical reactivity, enzyme activity, stnieniral protein integrity, etc 

1 5 . Two nucleotide sequences may have substantial sequence homology if the sequences 

have at least 70 percent, more preferably 80 percent and most preferably 90 percent lequcnce 
similarity between them. Two amino acid sequences have subaannsl se q u ence homology if they 
riave st least 50 percent, pttfet^ 

the active portions of the polypeptides. In the case of promoter DNA sequences, "substantial 
3D sequence homology* also refers to those fragments of a promoter DNA sequence that are able to 
operate to promote the expression of aasoristcd DNA sequence*. Such operable fragments of a 
promoter DNA sequence may be derived from the promoter DNA sequence* for example* by 
cleaving the promoter DNA sequence using restriction enzyme*, synthesizing in accordance with 
the sequence of the promoter DN A seq uence, or may be obtained through the use of PCR 
35 technology. Mullis et aL Mrth RnryttuM. ] 55*335-350 ( 1987); Erlich (ed), PCR Technology. 
Stockton Pre** (New York 1989). 

A promoter DNA sequence is Said to be "operably linked" to a second DNA sequence if 
tiie two are situated such that the promoter DNA sequence influences the transcription or 
translation of (he second DNA sequence. For example, if the second DNA sequence codes for 
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the production of a protein, the promoter DNAiwjacoct be opciiWy linked to te 
DNA sequence if (he promoter DN A s equ en ce affects the expression of the proton product from 
foe secood DNA sequence* to exampte, in a DNA sequence cor^^ 
aeq*»ce physically attached to & coding DNA sequence in the tainedihneik coMtroet, the two 
5 sequences mz likely id be opcrsbly linked. 



DETAILED DESCRIPTION OF THE INVENTION 
The present nventioti relates id promoter DNA sequences which tie 
I with coding se qu ence* for p tat protopor^ynnogen oxidase (refer red to herein it 
"proW; jm ixuauaiiorial ^plication no. PCT/IB9&0(H52 Rbd June 1. 1995, pufctisbed Dec 21. 
1995 «i WO 93/34659 and co-pending provisional application emitted " DNA Moseeuks 
Encoding Hart Pro tu purp hy mofen Oxidase end Iphfcrlcf Resistant Mtfaats Thereof fikd cm 
the uoe day as the instant applic*ion), These protox pro mo ter s cq ju rara have been found to be 
useful for the expression of a beterotofoui coding sequence in a plant 

The promoier sequel for the An»W^^ 
ID No. 1) is provided as SEQ ID No. 13. Isolation of this promoter from a fenodiic library using 
the atiociatfd coding sequence as a probe is described a Example K This tine approach can be 
used to Uolaie the promoter iapaczfrmtnyp^pKtotgte Any p«l« coding sequence 
w bich shares suff ciem homology to hybridize to die prose* coding sfqwrnce etwiurd with the 
promoter of interest may be used a* i probe in this approach. Since the respective protox-1 and 
protox*2 coding sciences from all plants are crxueaptoed to share this requisite degree of 
tomotogy.fediokeof whk* 
critical. However, for optinulty 

protox coding sequence* Most preferably, Che coding sequence used as a probe is from the same 
plant species ai die protox promoter of interest and is the coding sequence naturally associated 
with the promoter. 

The plant protox promoter of the present invention includes the Arabidapm precox* 1 
pfomotcr sequence set forth in SEQ Id No. 13 as well as corresponding protox*! promoter 
sequences ■vailabic from other plant species as indicated above. The presem invention also 
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me production of a protein* the promoter DNA se*p> ca c c would be operabty linked U> the ascend 
, DNA sequence if the promoter DNA sequence affects the expression the proiem product from 
the second DNA aeqaen ce , Fof example, in a DNA sequence comprising t pro moter DNA 
sequence physically attached to a coding DNA sequence in die same chimeric construct, the two 
5 acq ue nccs mc likely to be opcrsbfy linked 

DETAILED DESCRIPTION OF THE INVENTION 
The pretest invention rcUtc* to promoter DNA sequence* which are nmnlly 

tuoci eted with coding sequence* far p lent protopocphyrinogen oxidate (referred fc> herein as 
to, "protox"; Jet imematkmal application no. PCT/TB95XXM52 fifed June S. 1995, published Dec 21, 

1995 at WO 95/34659 and co-pending provisional appbeaiioa entitled * DNA Motaeufcs 

Eroding Plant Pro to porp hy rinogen Oxidase and lAmitor Resistant Muum Thereof" filed cw 

the wse day at the mum appbcttion). Theae prom pr omo te r ■cqu^nrafa have been found to be 

useful for the expression of a heterologous coding aequence in a plant. 
lj ^proinoiertequentt for tho Am 

ID No. 1) is provided as SEQ ID No. 13. Isolationof thii promoter from a genomic library using 

the associated cocUng sequence as a pro 1. This tame approach can be 

i uicdu> isolate the promw^ Any protox coding Sequence 

which shafts sufT dent homology to hybridize to die protox coding sequence a sa o ru a rri with the 
20 promoter of interest may be used as a probe in (his ap proa ch . Since the respective protox- 1 and 

prote**2 coding sequences from all plants are contemplated to share this requisite degree of 

homology, the choice of which pretax coding sequrAx a 

critical. However, for optimal hybridization results ii is preferable to use the most closely related 
protox coding sequence. Most preferably, the coding sequence used as a probe is from [he same 
23 plant species as the protox promoter of interest and is the coding sequence naturally as tnnairri 
with the promoter. 

The plant protox promoter of the present invention includes tfaeArgrw^prit protox*! 
promoter sequence id forth in SEQ Id No, 13 as well as corrcapoading protox- 1 promoter 
t available from other plant species as indicated abovCi The r*t*em invention also 
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cxpressxn of si operably linked coding sequence n the saint n^tnci it the exemplified protox 
promoter K<jii?noc Such functional fragment? oty be identified thfough deletion analyses or 
, other gapdardtoimiq^i^ro (tee, pages 546- 

5 S49of'tknulV\e&byLtwin,0^^ The present mverooo also 

includes DNA acrjoenc^s having substantial snjutflte homology with Ac protox profnotcn 
available frotP pUnt genet which coofcr an equivalent tevd and pattern of cxp r cssj nn upon an 
opetabfy linked sequence Suchpromccer sequences nary be 
, protca ymrMtm isolated 

10 of the plant protox promoters. 

As ittusmtfcd in the fnwijtki below, the DNA scmN3iccs» vectors ind \iwti£fi\n* plants 
of the present invention comprise a promoter scquciiccdew^ from a plant proasKgetfe Tile 
protox promoter DNA sequences are prefentri y (inked opershly to a coding DN A sequence, for 
example a DN A sequence which b transcribed into a useful RNA tmscript such as an antiseaac 

15 transcript, or « coding lo queoc c which is ultimately expressed in the production of a useful 
protein product. 

In a preferred embodiment, the protox promoter ii used to direct the expression of i 
nuoUfrt hertec^Urteteazyn^ 

c orr es ponding unmodified venioo of the enxyme. Such modified hcrbkadeMWtaotemytncs 

20 include terbicidtMtsilUMf^^ 
1 9426909 published Nov. 24* EPSP synthase («e U-S. PiL Not. 4,533,060: 4 J0JD61; 
4.940*35 and EP 550.633). gJuuraioe synthetase (GS; ate US. Patent No. 4 t 975 r 374) # acetyl 
coenzyme A cxrboxylase(AOCaie; s« LLS, Patent No. 5.161602), and acetolactste lymhase 
» <** VS. PaicrU No*. 4.76U73; 5J04.732; 5331.107; SflUjBS* 5.141,870; and 5 J7B.824). 

25 In a most preferred embodiment, the protni promoter ii used to direct the expression of a 

modified prom* enxyme which is resistant to protox inhibitors as illustrated in Examples 2-3 (at* 
alio International application no. PCT/EB95AKM52 filed June 8. 1995, published Dec. 21* 1995 as 
WO 95/34659 whote relevant parusft herein incorporated by reference; xm also co-pcoding 
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application niMWI m DNA Motrnries Fnnvliftg Plant P imuimyfaj i f u*o ajon Oxidase and Inhibit 
ftesistant Mutants TtarooT fDcd on the tameidsy m the naaaflt application). 

The trama/wif? plants of the pmasnt urvcntocc may be tiansfoijned by any naJhod of 
panafo ei nati on known m the set Thcae method* mcrode, for instance, transformation by direct 
frfcctk»Qrco-<uhivaben^ 

pal, Seine*, 223: 1229 (1985); Marto* Cutanea^ vol 
l.pp5l^521 (19M);dtf«|e«tnntfcrfl^ /2;27l7 
(1994); Loczx etaLMoi Gol A Gtml 1199:17* (1983); Froram «r aL, Mtfur* 31*119 (1986). 
uifcjy jii ij ectflc Urn J - i Hi w , Kkto *t at, BiaSTcdmeUw. £559-363 (1988); mjcctkn into 
protoplasts cnhaiBd cell* and assnc*, ftek± * «i> Kc/T*dmotogy> 4\ 100M004 (1986); or 
iipccrionrJ»iiMis«^ 

Mm**, J25474-276 (1987); Hooykaa*.V« Sioftem * at Ntfwv. J//:763-764 (1984); 
Grixufcy ef at, Bi*n*dwwto&, 6:185 (1988): and Grhmky «j < Mttm J2J:177 (1988). 

Tteffncnbonfeilkntra^ 
any icsti i Lti on lo what is descrihfd thcreui. 

EXAMPLES 



EXAMPLE 1: Iaotetkmeftb* 

A Lambda Zjp II gencmk DNA library prepsed from ArMdopm Aaliana (Columbia, 
, whole plant) was purchased from Strateger*. Approximately 1 25.000 pb*ge weicptocdMa 
density of 25,000 pfu (plaque forming uoiti) per 15 en Pttri dith ind duplicate lifts **re made 
onto Colony /Plaque Screen membrane* (NEN Dupom). The plaque hfti woe prated with die 
, Aribidnptis Prou»*1 cDNA (SEQ ID No. I Ubelod with 32P-4CTP by the random priming 
meited (Life Technologies). Hybridisation and wash conditions were at 65° C as described in 
, Church end Gilbert, Pre* Wait Acad Set USA 81: 199 1- 1995 (1984). Positively hybridizing 
, plaques were purified and m me excised into pBlucscript pUimids. Sequence from Che genomic 
DNA inserts was determined by the chain teanuubon method using didbaty terminators labeled 
with ftoaosocnf dyes (Applied Biniystems, Inc.), One done, AraFTlPm> was d efr m i intd to 
contain 580 bp of Arabidoptit sequence upstream from the initiating methionine ( ATC) of the 
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Protox- I protein coding acqoe ncc . This ckut also contains coding tec^jence and iiuroo* thnt 
extend to bp 1241 of the Pretax- 1 cDNA sequence. The 580 bp 5' noncodmg fl a gm en * is the 
pmativc Anbidopais Precox- 1 promoter, xod the sequence U set forth in SBQ ID No. 13. 
AiaJTtPro w» deposited Deoembcr M, 1 993, tt pWDC-t 1 (NRRL iB-21515) 

EXAMPLES anm ii ue tk m of ptont tr aimf o rpaa rt— vectors cipmnfag a * u i d 
PmoK*l ytm *—hir** the jnattvc AraHdopeis Fhsoqe-1 pewsitr 

A frlHengAcDNAtf the appropriate a)^^ Prowx-1 cDN A if t$almcd *i 

an EeoRI-Xbol partial digest fragment and doort nttoihepl«tex|3rt»«« vector 
pCGNt761ENX (see Eumpk 9 of bMcmtionai tppfcrioo na PCT/D9MXM52 filed June 8, 
1995.publithedDec.21, 1995 ai W09SQ46S9). TOs ptemid H digested with Ned «JB*mHI 
to produce a fragment comprised of the complete Protox- I cDNA phisaiTtfwjptko tenniniior 
from the 3* unttamlafcd sequence of the tmlgcoe of Agrefacsmm tuiu^iriau. The 
AnPTlPro plasmid de s cribed above is digested with Kcol and BarnHl to rxodnce a fngmem 
comprised of pBlucScript and the 580 bp putative Arabidopsis ProtoX-1 promoter, Ligatiouc* 
these two fragments produces a fusion of the altered protox cDNA to the native protox promoter. 
The expression cassette containing the Protox-1 promotcr/prptox-l eDNA/trol terminator fusion 
is excised by digestion with The binary ptemid 

is transformed by eict u o p oc aa on imo Agrobacterium and then into Axsbidopst* using the vacuum 
tnfiUralioa method (Bcchtold tt al CfLAcad ScL Paris 316: 1194-1199(1993)). 
Transformants expressing altered protox genes are selected on kanamycin or on various 
concentrations of protox inhibiting herbicide, 

EXAMPLE 3: ItoefectiootfbeiWddt 

r I UUlX-A pffTltTWIMT/wWCfTPD IIOfAK"! WjM 

Using the procedure described above, an Arobuiopsis Protox- 1 cDNA containing t TAC 
toATG (Tyrosine to Methionine) change at nucleotides 1306-1308 m the Ptmox-1 sequence 
(SEQ ID Nal) was fused to the native Protox- 1 pio m utcr fragment and tranxfqfmod imo 
Arcbidopxii ihaliana. This altered Protox-1 enzyme (AiaC-2Met) has been shown to be >l0fold 

u 
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r trifiuU to various protox«inhibitbig berbicidee Am I 
tested in a b a m nal cxpresnoo system (m* Rinplc S ctf cope nd i ng U.S» i 
D^Moteoto Encoding 

Tlicreof" filed on the came day it the nunc ipptictfon)* Seed from Ae 1 
was collected and plated on aracfo (10XnM-l*Oi)M) erfapfeto& talubra^ meykxtwcH I 
fbnotlaXVIL Muhipfe experiments wrft 
coutxiiif ation of thif confound it piflickut to prevent 1 
seeds expressing the AnC-2Mei tltctricnzymt fuaci tote tri*?rttt-l 
f¥^AribidopAsndliap« beAie^eoDmtmk^ 
h it her herbicide u4n ante when co m pare d to wild*type A rabidflp lta. This] 
protoi enzyme fusion therefore fanctfoos w in effective scrrttsrafv marker for ptaei 
transfcrmatiafL Sevc^ of the plana that germinated w IOOD11M1 
were transplant! 1 *! to frit, grown 2*3 weeks, tad tested in a spny auay with < 
concentrations of the protcx-tnhibitinf herbicide. When compered to empty vectm* UJUUuI 
tnnsforaantt, tte AnFTlFnVAnC-TMet transgenics were >10fold more tofcram to the 
herbicide spray . 



EXAMPLE 4: 



bobtkm ofMmbe Prota-l 



30 



A may* (Missouri 17 inbred etkrfatedrcod^^ 
FIX H vector was purchased from Scratagene, Approximately 250,000 pfiiof the library wm 
plated si a denary of 50,000 phage per 15 cm piatt *ad duplicate lifts were made onto 
Colony/Plaque screen membranes (NEN Dttponi)The plaque lifts were probed with the mite 
Protr**l cI>NA(SEQro No. 5) labeled wiA^ 

Technologies). Hybridization «r*d waA curtditfons went at 65° C as described in Qfcoxh sod 
Gilbert JVi*. AfaL A«k£ tf£4 47; 1991-1995 09*4), Fnnrivdyhybrki^ 
purified ard rescaeened wiA 

1 cDNA. Lambda phage DN A wis isolated from three phafe that hybridized to the 5* ft^gjnent 
using the Wizard Lambda Preps DNA Purification System (Frooega)* RcArictxniuiynnl 
hybridization to the 5* mai&rra^ 

12 
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im gene, while atLuJ ififly 
from both type* of ptugc 



mot pBhncript vector ftar 



Numeraa tinuaftinnariaQ vector! tie ffvaileMe for plant I 
pronttcre nd chmcne pm af thtf favcaftioo cm be need io 4 
The selection of vector for we will d tptud upon fee pntend I 
target ipccicf for umrfofi iMtCTi For ce rtain tarect aptc ica, dtfficicot J 

uIaaAbJW ^M^^^aW^^^A ^^h^u* B^^fc jm^ak^J^^fc^^^J Cm^^^AAk^^A d^^^^^aV^^^n ^B^^^dl ■ 

a ciciuuu hmxhi may dc ^jiuuiul jfcHJLuuu nawaeiv m I 
(he upci? gene wUeb oonfeft ifiiatanf 

79; 259-268 (19X2* Bern* at. Mi** 30#/lS^l87 (J9fl3)). tbei^feae wb^ aw*m 

Spencer *f <xi Theor Appi Genet 79: 625^631(1990)), flat hph gear wfaidi codoi icxina*x id 
the antibiotic hygromycin (Blocbinger ft DiggehnarawMrf C*U Biol 4: 2929-2931 X tod tbedtf' 
gene, which confers resistance to methotrexate (Bouroaii tt aL, £MB0 / 2(7): 1099-110* 
(J983))- 




(1) Coostrucdon of Vcctm Suitabk tixA&i*ba£$aiim*T tmMftAuuii 0& 
Many vectors are available for trasfonnaDon using A^robocteriwn mmcfaciau. These typically 
20 carry at least one T-DNA bontoseqaeaxi^ 

Acuir /to. (1984)) and pXYZ. Below the coostructkxi of two typical vector* is described. 



Tbe binary vecton pQB200 and pOB2001 arc used for the 
25 vcciox* for use witfe Agrvbacterivm and was constructed in the fbttowing 

was created by Sari digestk* of pTJS75 (Srhmkfhanicr ft Ifcliiafku/ Aaeaviot. 

13 
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(19*5)) allowing mririon of the aMracyclnw reiwtiact p^Mk^by arm* of mhtd 

rrafowmirompUC4Kc*a^ 

aL,»at*rt 304; 114- JJ7 (1983); McBride *JL, PLmtMtkeularBioloi? 14: (1990)). 

Xk>/listitmwmligi«cdtoche£eo^ 

DNAboidHif Apltntietectabtefm^ 

Cm 5J: 153-161 (1987)), mi the XtoAcfipitod fr»*mft mi ctarf »S***fa—d 
pT!S75k«n to ci«aiepCIB200(icc alio EP 0332 104, example 19 [1338]X pdBMO minim 
die following itniquepotyMrcteftrk^ £co/Z/> Sitf, Ipn/. BfW, Xbal, w*tS*£l. 
/>OB2<W/ttaderWitw^ 

aotftionaJicsaictioo siftes. Unkpjcratriax* irtri m wvEcoKI, SstI, 

Kpnl, BgUl Xbal. Sail. Mhd } BcU. A wii Apal, Hpel. and Sdtf. pGBZttl, is addition id 

right T-DNA borders fix AgrobacteTiMm*WD&xcA trattsformxiom, tbc IUC2Hfcrn*d trf/i toacben 
for mobilizjukiD btrwocn £ ooli tod other bom, and tbc Orfl" tod OrfV fu n ctio n s abo from RIC2- 
TbtpCIB2001 potylinfccr is suitable for the cloning of pinot exprnsoD < 
owaregitiatty signals* 



Ccnanwtign of pCOMP and HYrfgrnycin Sdnttion DmYWiwtfaciwf 

Tk» frngry wtnr f*~TR 1ft *nftain« ■ *fwr«4mg Ir wmy rin reiiitanrc for ^Ifrtirw m 

20 plants. T-DNA right and left border sequences and incoipualca sequences from the wide hort- 
rttgepUsmkipRK252»Ikjwing^ I* 
contraction is o^acribedty 153-161 (1987), Various demntivesof 

pCPlQ have been constructed wbkfa inaxporite tbc gene for bygromyco B pbua^Minrgfemc 
described t*Grtarfci.<^ 25: 1W 

23 transgenic plant ceSs on hygronrycia only (pOB743), or b ygcanycin nod tauyuyun (pOB7l5> 
pCB717). 
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(2) Comtmctkn of Vectoo Suitable far rw*-4, 

TwufuittMtiOM without 4)6 mb o( A^fobocttHt^t 
mpnitmut for T-DNA tommoti to 
ucunf iTKte joqucqom on or oanxeo id iwiiifW id 
5 M&cctnmmT-IMAwcquctM, TtPMfa M *i oo 
Atnbaatrim* include t/ansfuwtiop via particle 
doo) nd microinjection. Tbc choice of 
i for tbe apocicv beijuj uauafuiiDBiL Dokiw, As 
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rely «■ 

pretopltft spake ICC and 



tfymxal 



CnnimirtiiMofpaB^ 

pCIB306* is a pUC-derivcd vector suitable for direct ftt 
anibination wim aeJccto 
comprises the CiXfV 355 promoter in opentioaat fusion to me £ c*U CUS fcne and the C*MV 
3SS trraiaTptiOfttlteniu^^ 

The 35S promoter of (to vector contains two ATO i m^^ Tbeaeste* 
were mutual using sbmdard PCR trrhnVpirt in such * way m to remove the ATGt a*d fm*c 
me restriction sites SspImdPvuII. TbemwnMtkfo§toiMe96Md3?tymfMyhomtoe 
unique Satf site and 101 and 42 bp away from the aemal starting Theftsumnt derivMiwof 
pCIB246 was drsignairri pdB3Q25. The OUS gene was men exdjed from pQB3Q25 by 
digestion with 5^ *nd Soothe lenmni na^^ 
pQB3O60. The plasn^jJITK was obtained fra 
bp&nq/ftmgnicittccotiinm^ 

and inserted iito the /(ptftitetf (1M7)). 
Thii geAetited whidi oompM 

prosper tod tcnnmiiorta 
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■ uiU fc k for ftc ckwim af ptawt uqjuiim m w— u«« ii i — » itp»l«oiy 
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Coftaimctkm of OSQQ19 

pSOQ35iia 
(DHFR)asa 

the 3SS promoter (-M0 bp), town 6 from 



vector which ndlne Che E. eolf | 

B imrtMawn a t PCR waawadtoi 
Adhl few (-530 bp) *d It bp of d* <R» 



ipSOGlO* A2J0bp£r^cmerta>cafaf *e£,c0ff 
dihyckciblate reductase type H «cw alao notified by POtiad dm two I 
wereaseeml^withe^-Jrftaira^ (C3gi^)wl^««pri»d ftepUCtt 

vector bacJo^ aid the Awen*rycrf*c«fiMjac^|eoen*Dd 

pSOC19 which coDOim 35S proiDotar in 

the DHFR gene Mid the nopals Replwmertof teGUS k**er * 

pSCKJ19withtbekac*T*Qi^^ 

vector pSOG35. pSOG19 and pSOG35 carry thepUCgettefor an*fc^it»*anoe«dh«ve 
Hbuail SpU. Psd and EcoEl site* available far the cknzag of fdrdgn sentences J 
gene sequences containing a plant protox promoter* 



EXAMPLE 12: 
50 Codwg sequence* nin^ 

protox promoter may be wsembted in exprtai^ 

upstream of a suitable transcription tenninatoc. The nanhing chimed je«» can n^ bee 
transferred to the plant mufcTattm vcO« 



25 grntpx Prompter Selection 

In tceetdantt with the present invention, the 
Tbe selection of the specific protox promoter 



wiDocmaapiantproaox 
uaed in die chimeric fen* k priority up 
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a ptant species cto*ebriclA*d to, efin^ 

the lesuhxag chimeric fette. ForexAnpftc.if ttoctawkfcw tsmtcnoWso^ 
mAize plant It would to pttfenble id ok a pretax proroaser from a ittooocofyk*teiium pins ned 
S most pctfof Able to mo a i 




A vAriety of traoAcripticQAl tomiwiart a* AVAilsbte for 
At responsib le for the tcnniiiiiioo of trBflAcnpcioo beyond the 
10 potywfaiylrtka. Appicpte 

in plots nod toclode the CaMV 355 fenmamr, the An* 
tenninstor, the peA /*cS 69 tennnutor. as wtQ is tnmituiart OAtnriDy 
protox gene (ix< *pvoco9C tstnuxiAton")* These cao to nod in both 
dicotyledons* 

15 

ftairncra for to FJihttccmffl or Rwliaw tf Brnfai^ 

Numerous sequence* hm been found to co hip c e gene expression fitm within the 

tnoscr^Akmal unU And itoAc •equoic^ 

inven&oo to increase their expression in ftnifB 
30 Various intxua 8cqueoce« tovebec& shown to i 

inonocotytodopoas ccDt> focxAmp^ tteintroos of tbemAizc Aifci \ 

significantly enhAnce the expression of the wild-type j 

imnxiuctd into maize cells. Imroc 1 was found to be particularly effective And calmed 

exprosskm in fiukm constructs 
25 Genes Develop. 1: 1200 (1987)). la the same e xp e rimental sygctn, toe inm» from the 

mate bronze J gene had asaailig effect in enhancing expression (Calljser 11^*)- Introo 

icqocooes davc been routinely iuuorpcffitri into plant ttipafonnsboo ytuoo, typtofly within the 

noo-tr»nsl«ed leAder. 



found to 
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romper oi iwo-trisauaw 
enhance cxpiLiiiofl, and tfwe ere particularly effective in dkocytodottu* cefl*. ScrofcaOy, 
leader aequcnces from Tobacco Mosaic Vina (TMV, (he n¥-eequef»~X Maize Chtorotic Mottle 
Virus (MCMV), *ftd AHalfa Mouk Vinis (AMV) have been shown to be effective in entering 
e*pmak«i (e. f . Qdbe * oi MrL Addr Jlr a J)' 8695-871 1 ( 1987); Staazeaki *t ol Mjwt Jfofcc. 
Bid 75:65-79(1990)) 



fyir* w r * f fir flHW Pmftm VMim thr fdl 

Various rnrriiaiiraius for targeting gene ptomicfs are known tp cult to p lants and 
10 sequences controlling (be functioning of these mechanisms hive been dnncferi 

For example, the targeting of gene products to the chloroplaat is cootroDcd by a signal sequence 

found at the amino tfcamukal end of various protests and which at cawed during liikMOptaat 

import yielding the mature protein (ex Coroai ctaL J. AM. Oat*. 265; 15104-15109 (19BS)> 

These sigiul sequences can be fuse^ 
IS heterologous products into the cfcloroplast (van den Broeck«r4, Natvt 3l3\ 358-363 (1985)). 

DNA encoding for appropriate signal sequences can be isolated firm the 5* end of the cDNA* 

ettccding the RUBISCX) protein 

and many other protemiwnic^ 

Other gene products are localized to other or^gajid 
20 peroxisome (*.g. Unger ct aL Plata Mofcc. Biol 13: 41 Ml* (1989)). The cDNAs encoding 

these products can also be nunirwilatrrl to effect the targeting of hctcrotogous gene products to 

these organelles. rkajfttpte of sudi sequences an 

aspartate amino transferase boforms for mitochondria. Tarring to cellular protein bodies has 
been described by Rogers et oL t Proc NaiL Ami Set USA 82: 6512-6516(1 985)). 
25 In addition sequences have been chATactrrixrd which cause the targeting of gene produce 

to other cell c&mpartxnenis. Aniix>o terminal acquenccs arc responsible for targeting to the ER. 
the apopUst, and ejorirellul ar sccrctioo from akurone ccOs (Sochler & Ho, Plmt Cefl 2: 769- 
783(1990)). Additionally, amino terminal sequences in etM ij untaiun with carboxy 
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sequences are responsible for vacuolar targeting of gene products (Shmifai tr oL, Plant htoltc 

Bid 14: 357-368 (1990)), 

By the fusion of the appropriate targeting s e quence s described stove to tnmgeoe 

sequence* of interest h is possible to direct the tmspro product to any organelle or oeD 
5 compartment For chloroplstt tar^ting, for cjuunpfe, the chloropiast signal s ny i f nrc from die 

RUBXSCO gene, the CAB pme, the EPS? synthase gene, or the G52 gtae if food xn frame to die 

amino terminal ATC of the tmnsgene The tignal sequence s e le cte d thootd inchrti the known 

cleavage the and the fusion comrnrtcd should take into account any amino acids after the 

cleavage site which arc required for cleavage, In some ease* this rcqutretnej* may be fulfil 
10 the Kidition of a inudl numbered 

or atenstively replacement of some amino acids within the tnragrae s tqucecc . Poskm 

coosmrad for chtoroplastmrpart can be tested to 

translation of m vitro transcribed tti^^ 

15 Motoilf Btoloyv Hifcvicr. an 1 Oft U 1091 ri9R2V. WmnmttaL HoL Gen. GemeL 446- 
453 (1986)). These construction techniques ire well known in the art and are equally applicable 
to mitochondria and peroxisomes. The choice of targeting which may be requited for expression 
of the tranigeryy will depend on the cclhilar^^ 
point for a given pathway. This wiU usually becytorofcorcfaloigpiastic,^ 

20 esses be mitochondrial or peroxisomal. The products of transfeae expression wiU ** normaty 
require targeting to the ER, the apoplast or the vacuole. 

The sbovc described mechanisms for ceUular targeting can be utilized in eoojuaoica with - 
plant precox promoters so as to effect a specific cell tempting goal under the transcrqttonal 
regulation of a promoter which his sn expression pacem different to that of the promoter from 

75 which the targeting signal derives, 

EXAMPLE 13: TVansfonnattaarDko^ 

Transformation trriimque* for dicotyledons are welt known in the srt and include 
A^rtrfwctenum-based techniques and trrfmiqiiet which do not require A jf iebaae ri mt. Noo- 
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Afrobo eurimt i nchi a jw involve the cplakc of eiop u io pi yn con inaecnai dbcgly by 
protoplasts or cells, This caw be accotnplabed by PBO or ekuiupuatioo mediated opttfce. 
panicle b uuj b miium t-iDcdiated de&very* or anatnQfictnL piamplmaf tbeactechmgocsarc 
described by Fsskowfti et a£. BUBO J S: 2717-2722 (19*4), Petrykns « 4, MM Gem. Start 
199: 169-177 (19&5), Roch rfal, Ooadbiotop* 1001-1004 (1996). end Kkmitat Mn»t 
527; 70-73 (1937). L* thetratrfomiedcelli wiepAmrtttwtetepkWQnv 

stendwd trehnkyes known in the eit- 

AgwaVflctciiawHnediieed fiimfuniMdon it i jj i rfmtd I ft^fug pe tor trMuf^jmjo o/ 
dicotyledons becew of to bj^ 

diffeitnt species, The txmny crop s pecies which sic i outinf l y tr s mfta roabi t by A^r mf m rte r^aan; 
iadudetofccc^tamaio^sin^ aod poplar (EP 

0 317 51 1 (coooo), EPO 249432 (tomrto, to Calgcoc), WO 87^7299 (Afujji^ toOJ^ae), 
US4,795.SS5<popUr)). 

Tiantfomatiea erf the target pta 
CMiMvation of the Agr*^ 
known k) the aft. Transformed time is jcgencmed on i 
or herbicide tesistaiioeinartep^ 



EXAMPLE 14; 

Tmrfditniikm of moa monococyledoc species his now iko become roptine. Preferred 
technique* indwfc direct gene transfer into protoplasts wag PEC or t kuio po rari oo techniques, 
and particle bombardment into callus tissue. TrattsforniatioGscattbe 
DNA species or nulDpk DNA species (u, co-tmsfonnation) sod bath these techniques nt 
suitable for oie with this inveiitioft, CoKranif onnstion may have the advantage of avoiding 
complex vector construction and of generating transgenic plants with unlinked loci for the gene of 
interest and the selectable marlcer*, enabling the lerncval of the selecsabte marker in subsequent 
geneiaDOftt* should this be regarded desirabk. Howtttt.aeasadvwtageof theuseofcor 
M^oimarion is the less than 100% fte^neacy with whfchaepafnte DNA sp 
into the genome (Schoeber et aL Biouchnoiogy 4; 1093-1096 (1986)). 



hut Application* EP 0 292 435 (to Ob»Gefe), H» 0 398 22S (to GbfrOcj&XWO 
93^7278 (to C3b*-Ocij7) wd ILS. Pmt No. (to QboOdgy) deterib» acfaaiqpt* far 

ibc prcpenooo callus and prv^o pl a m from id £Sfc iiibujj too of maoe, (iaivfiaaie£OQ erf 
protoplasts using PEC or cleumptfabon, and the mgcoenikni of maize placis from u audia mcd 
protoplasts. CoidOD-K«Dtm er<tf. t Pfewr Cell 2: 603418 (1990)) tad From a aL> 
Bitmdmoto&& 833^ (1990)) have poWisb^ 

maoc tine using particle bombardment I\tnlmmm^ a p pfwri oo WO 9307278 (to CSbfrQeigy) 

ad Kaael * al, Biotechnology 11; 194-200 (1993)) fkacjitetedn^^ 

or cuk moral unci maiic cy panic g tMiamumtuL iius aw wiMpr roinri auiiiwiiic mam 

embryos oft 5~2Jom lengthen 

lOOOHc Biottstm device for bofotandmeoc 

Transformer ion of rice cm also he malLrtafaa by diwet jbdo twMfcf mchnicflea, uifliBag 
pratoptasta or particle bowbtiiiiucDL ^n^nplaff^^rrijalM irinlfrr fimiHw ^^i tfff* 4nrfifrr^ fir 
Jfporuco-types and /ndtaHypes tfZantet oL* Plant CtURzp 7\ 379-3*4 (1988); Shimtmnm «r 
*t Muure lift 274-277 (1989); Dana el at Bioudmob& &• 736*740 (1990)), Bath types sm 
alio routinely transformable using particle bonbetdncm (Chritai t? oi Biotdmolpfy 9; 957* 
962(1991)). 

Patent Application EP 0332 581 (to CSba-OeigY) describe* techniques for the ^aeration, 
uansformariofl and mgenrntion of Pnuidrse proeoplasta. These technique! sDow die 
transformation of DoayUj and wheal r urt faei m ort. wheat tntfitfofmaooo was beeo deathbed by 
Vatil et at, Bioitchtology 1 CL- 667*674 ( 1 992)) using particle bombardment into cells of type C 
long-term tegcaerebfe callus, and also by Vasi) <r at, Bioudmoioty 11; 1553-1558 (1993)) and 
Weeks el aL> Plani PhyjiU 102: 1OTM084 0993) using iwrtkie 

embryos and immature embryo-derived callus. A ptUurui technique for wheat transformation 
however, involves the transformation of wheat by particle bombardment of inmamt embryos and 
includes cither a high sueroec or a high mahoae step prior to gene delivery. Prior to 
borabardmefiu say number of embryos (0.75- 1 n*ninksgih)ampiate4c*iol4Snvd^ 
3% ucyw (Mumbife * Skoog. 

f«- induction of tomatic embryos which is allowed to proceed in the dark. On the cornea day of 
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(U.iutiv^fi»dfciuw& 

15%). ItecibbiyoimiDd^teptMD^ T^ay 

S (agfa* pOB3064cfpSG3S)k^ 

procataret. E^ptate<rf«biyoftt$botita 

pci*ssrcof-1000pijQsms*itsndirf Altar tefpbardaem, Aeontayei Me 
pt^b^brtotte^torw**fori^ Altar 24 ho. the 

10 ct>y for ihoot jl imnnh before uy wnttoo. Appraufltttoly ct 
with dmlof^ abtayofeae catta 
HAA, 5 o«/Uto GAX fimteceottb^ 

ofpOB3064Mrf2u^iM*batttxa^ After ■ppraxin»te»y 

< developed ihooo Bt t xuafemd to Ufjar itmle oaatkzoen known m °GA7t a wfak± 
15 tudtonxgthMS^ 



White the ptctentiavaifrrt to fa^ 

potti^iadftcccRlx^tfw^ 

tt being Witfun the spirit *nd icopc of the pnscoi invention. 



'mmumHmm s k b 



(1) APPLICANT: Nt^lU, Marl* A. 

Volrath, Sandra X*. 
mxA, trie A, 

(ID title of xmarrxoMt pkomotm ntow plabt wowwmiWB 

QTTDAS* cmns 



CUD Monn or snmrcu 

(lv) 



13 



; dba-Osloy Corpora tioa / Patonc Dapt- 



(A) — _. „ 

(B) STOW?: $40 Khita Plain* ft*. 
<C) CITY: ltrrytoa* 

(0) g TATE; JCY 
(S) CUUHfRf : USA 
(PI MP: 1O5H-9O05 



(v) 

(A) idsomc rm: rioppy disk 

(B) COMPOTE*: IBM PC coapatlbia 

(O omAton ffrsm: pc-dgs/ks-dos 

«» SOFtlORZ: PBtantlft Ml«iN #1.0, Waraloi #1.25 

<vi) CURRENT APPLICATXON &MPA: 

<AI APPLICATION MONftER: OS TBA 
fB) nUSC DATE: 
t€) CLASSZrZCATZONi 

(Vli) PRIOR APPLICATION MA 

(A) APPLICATION NQMHOts US OB/261,158 

(B) PILXIC DATE; l*-Jt&HM 



(A) HWIt: Ela*r, M Acotv 

(B) REGISTRATION OTMBtfll 36,129 

(C> RHFEAB9C£/D0CnT ROBBER: COC 18467proV 

(IX) TSLBCCMMOKZCATI0I9 XMFQJQIATXCM: 
(A) TELEPHONE ; 919-541-8614 
<B) TELEFAX: 919~541-06B9 



U) IKPORKATIOir POR StQ ID MO;l: 

(1) CWkBACTEBISTXCS: 

(A) LSWTH: 171» bass pairs 
IB) TYPE; nuelaic acid 

(C) snuanasDHCSSi iiaala 

(D) TOPTOOCY; liMar 
(11) KOLXCULB TYPE; cEKA 

(111) HYPOTHETICAL: HQ 
(W) BVrX-SDiSB; MO 



.«L C • ■ t- -t ffc «_« *— « * 



_* 
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fix) 



U) nvr/isTi cm 

II) LOOVnOM: 31..1M4 

id) otMBi taratmrzoB: 

froa bMBC-2' 



(Xl) BWBOMK* BnCKmoM; 119 IS BOili 



tqacaaaatt ccoaat*ctc looanrcc ato oao tta tct err crc 

Mvt Olu Lw Mr Lot 

1 5 



COT CCO 



aat etc 



40 



ACQ ACT CJU TCO CTT CTT COO TCO TTT TOO AM 
Tar Thr Ola Mr L*u Lou fro Mr Pao Mr Lyi pro m 
10 15 30 

AAT OT* TAT AAG CCT CTT AOA «C COT TOT f» « OCT 
A*n V&l Tyr LyiMolAUAxvUuJ^CyiMr^l Ala Oly 

35 30 35 

ACCGTCOGATCTTCAAAAATCaAAOaCO0AOaA0QCA«ACC 

nr val oly Mr Mr tys xio Olu oly Oly oly oly ftr ttor 
45 SO 

AOS OAT TOT 0TC ATT GTC OQC OCA OPT ATT MOT OPT CTT TOC 
Thr Asp Cya V*l Il« Val Oly dy Qly Xla *«r Oly Lsu cy« 
60 « *• 

CMS GCO CTT OCT ACT AAC CAT CCT OAT OCT OCT COB AAt TTA 
OlAAl4lMAl«ttarLy«Ki0fveiA«pja«Al«r9mA«nLMi 

75 10 i« 

ACC OAG OCT AAG GAT COT OTT OCA OQC AAC ATT ATC ACT COT 
Ttor Clu Aim Ly» Asp A*o Val oly Oly Am Xls 3l« Tax Ar* Olu olu 
90 95 100 



aiy 



55 



ATC 
Xl« Al* 



lla Val 



AAT OCT TTT CTC TOG GAA GAA OOT COC AAT AOT TTT CAA COO 

Asa oly Pa* l^u Trp olu Glu oly Pro Asa Mr lfes Ola rro 

105 HO US 

CCT ATO CTC ACT ATG CTC OTA GAT A0t OCT TTC AAC CAT OAT 
Px* Mt Lau Thr Mat Val Val Afp Mr Gly I*u Itft Asp A*P 

12S 130 



TTC OOA GAT CCT ACT OCC CCA AOG TTT OTB TTC TOO AAT 
Lau Oly Asp Pro Tte Ala Pro Arg Ft* Val Lau Trp Asa Oly 
140 145 ISO 

AOG CCO GTT CCA TOG AAG CTA ACA GAC TTA CO* TTC TTT GAT 
Arg Pro Val Pro Mr Lya bau Tar Asp L*u Pro Faa »»* Asp 

155 K0 1« 



AOt ATT OCT OGG AAC ATT ASA OCT OOT TTT GO? OA CTT 
Mr xl« Cly Oly Lys Xl* Are Ala Oly Pfe* Gly Ala L*Q Gly 
170 175 HO 

CCG TGA CCT CCA OOT COT GAA GAA TCT OTO G*0 OA© TTT OTA 
Pro Mr Pre Pro Oly Arg Olu Olu Mr Val Olu Olu Fta Val 
165 IPO US 

AAC CTC OCT OAT GAG CTT TTT GAO COC CIO ATT OAA CCO TTT 
Asa Lau Oly Asp Olu Val Fat Olu Arg b*u Xla Olu Pra Pa* 

14 



TCT GAT 
U0 

TTQ OK 
iisu val 

1« 

AAA TTC 
Ly* L*u 



TTO ATO 
tsi HK 



ATT CCA 
Xl* Axy 



OQ8 COT 
Arg Arg 
200 

TOT TCA 
cy« Smt 



ioa 



ISO 



1M 



244 



294 



U2 



3M 



431 



4M 



S«2 



«0 



471 




90S 



09t 

Oly 



oly 



ACT 
Thr 



OM 
MP 
205 

AM 
Lyo 

AAA 

Lys 



Oly 



CMS 
Gin 



etc 

Ln 
345 

TXT 

Tyr 



ATC 
XI* 



Uu 



TCC 



CfTT TUT 

vol Tyr 



am} orr 
ty> vol 

TTT AM 

Pho Lor* 

250 

coo coc 
Pro org 



ooa err 

Oly Lou 



art am 

Vml Lya 



OGA TAC 

oly Tyr 
315 

AOC AAA 
Sor Lyo 

330 

TTO COC 
L*u Ary 



Alo Oly 
220 



Ttp I*» 



G*T CCT TCA 
Aop Fro Oar 



CT»QMCM 
Lou Olu oift 
240 



310 

CTO 



315 



225 



OOf OOA 

Oly Oly 



■ot Lyo Alo Alo Pho 

230 

AOC ATA ATA 0»f OOP 
Oor Ilo *lo Oly Oly 
245 



OCA ATT 
A3* Zlo 



CM OM A00 AAA AM 



OOA AT© 
Ar* ft*t 
214 

TTO TCT 
In Mr 
300 

MC TTA 



AM 
Am 
425 

<XJt 
Oly 



TAC CCA 
Tyr Pw 



CGA ACA 
Aro Thr 



CAT CCA 
Hi* Pro 

TCA CTC 

sor t«u 

410 

TAC ATT 
Tyr II* 



GM TTA 

Glu 



AOP OTT 

Sor v*l 



CCT CTT 
Pro Lou 



CCA OTT 
Pro vol 
345 

GAA TOT 
Glo cy» 
380 

COC MB 
ATO Thr 



TTT CCA 
Pho Pro 



QOC GOG 
Gly Oly 



OTO GAA 
Val Ola 
445 



Ola ow Ar* 
255 

AAA OCA CM 

Lyo pro Ola 
370 

fro oca oaa 

pro Olu 



tOQ AM CTC 
Trp Lyo 



ACA TAT QM 
Tar Tyr Olu 
320 

OTA ATO ACS 
Vol W*t Thr 
335 

TCTOU TCT 
fior Olv Sot 
350 

OCA OCA OTA 
Alt Ala Vol 



TTO ATA OAT 
Lw U« Aap 



CAA OGA CTT 
Gla Gly vol 
400 

AAT COC OCA 
Aon Arg Alo 
415 

TCT ACA AM 
Sar Thr Aon 
430 

OCA OTT OM 
AlO Vol AOp 



WO 



Pro Lyo Aio alo oof 
340 



Oly 



tCA 

305 



Thr 



ore 
vol 



OCT 



TCt 



GOT 

Qly 
385 

GAA 

Glu 



COG 



AOC 
Thr 



CAA ACA 

Ola Thr 

27* 

ATA TCT 
llo fox 
2f0 

OOT ATC 
Oly Zlo 



CCA OAT 
Pro AOp 

CCA TCT 



OTT <Mft TCT 

vol oly oor 



2O0 



Alo Aro Lou oly 

205 



ACT AM CtO OM 
Thr Lyo l-au Olo Oor 

310 

OO* TTA OPT TCC OTO 
Oly Lou vml *** Vol 

325 



OCA AAT 
AlO Asn 

355 

ATC ICS 
Zlo 

370 

OAA CTA 

Olu 



CAT OPT 

U±m Vol Ala 0or Oly 
340 

OCA CTC TCA AAA CTA 
AlO Lou oor **o Loo 

300 



An 



ACA TTA 
Thr LOU 



COC O0A 
Pro Oly 



OGA ATT 
Oly Zlo 
435 

GAT TTO 
AOP Lau 
450 



TAC CCO AAA OAA OCA 
Tyr Pro Lyo olu Alo 
375 

AAO OCT TTT O0O CAA 

Lyo Gly Fba Gly Gin 
3*0 

OGA ACT ATC TAC AOC 

Gly Thr Zlo Tyr Sox 
405 

AfiA ATT TTO CTO TTO 
Ary ZIO Lou 1m Lou 
420 

CTO TCC AM TCT GAA 
ox l&m Sax olu 
440 



AOS AAA ATO CTA JUT 
Aro Lyo HOC LOU Zlo 
455 



AM CCT AAT TOO ACC GAT OCA CTT AAA TTA OOA OTT AOQ OTA TOO OCT 
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410 445 410 



CJAoccAWccTCiWTTfCTAcnTOCffa«wra*Tj*ft:err 

OU AU XI* fro OlA Itw 140 V*l Oly Hi» Ph* A#0 H* Lau A+p Thr 
479 440 445 



OCT AAA TCX TCT CTA ACQ TCT TCO O0C TttT QAA MO CTA TTT T*J 

All Lyi Mr Hr uu Hir ftr Hr oly tyr oiu oly Uii u« oly 
4*0 495 SOP 



gac aat tac ore act «tt <m ooc rtk oac coo wr or* oaa 

Gly Mn Tyr VAl AU Oly Val Ala Im Oly Am Cy» V*l em Oly Ala 
305 910 S15 536 

Tyr frlu nr Ala ll« Olu VbI A«n Am tta Mtt «w ACf lyr AU tyc 

525 530 515 

AAA TAAAT9TAAA ACATTAAATC TCOCAKTtO O9IQI0TTT? 
Lya 



14f4 



1943 



1W0 



1434 



•v. 

Is 



rTQAOATATC CAAAAAAAAA AAAAAAAA 
(2) ZXrORNATXOtf FQft OQ IB K:2t 



(A) LOOID: *37 

(B) TYFV: amino 4Cid 
(P) TOPOWGT: llmtr 

(111 M0UBC0L1 Tttts protaln 

(xi) SIGODCS XSSOimCRi AO ID HO**: 

tt*t Glu Lau S« Uta Lou Arg Fro Iter thr Ola M 
15 10 



15 



fa* su lyt Pro am ia*i Ar* w Am Vkl Tyr lys Fro Arg l« 

20 25 30 

Art Cya Sar Val Ala Gly Gly Pro Thr Val oly Sar Jar Ly» Xla Qlu 
35 40 45 

Qly Cly Gly Sly Tax Thr Xl« Thr Thr Aap cya val Ha Val Cly Cly 
50 55 40 

Gly Il« sax Gly l^u Cy» Il» Ala Ola Alt I#u Ala Thr X*** Bla Pro 
«S 70 IS BO 

An Ui XU Pto Aan Uu U« Tbr Clu Ala Ly* Aap Am Val Oly 
45 50 »5 

Oly AM Il« 21t Thr Arg Clu Olu Am Cly Pfea fcw Trp Olu Glu Oly 
100 105 U0 

ProAMSarPhaGlaProteAMProllattMThraatValvalAap 

115 120 125 

Sar Gly LauLygAapA*pI*uV*lLau Oly Aap Pro Thr Ala Pro Arg 

130 135 140 

25 
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Pea val Lau Ttp Mo oly Lyo 

14S 130 



AraProvolrraoarLygLoaTlar 



its 



Xla Oly Oly iy* XI* Arg Ala 

170 173 



Oly ft* Oly Ala Lau Oly 11* A** Pro Oar fro Pro Oly Art Olu OU 
1*0 ioo 1*0 

tar Vol Qlu Olu fba Vol Arg Arg AM Lou Cly A** Oly W MN Ai 

ItS 200 204 

Arg Uu xla Qlu tro rtoo cyo Mr oly vsl tyr Ala Oly A*p fro o*r 

310 215 330 



l^LouaaraatbyoAlAAlAlaaQlyttfgVilTrpLyo 

33$ 230 33ft 



Ola 01s 
340 



Oly Oly 



XI* XI* oly oly Thr Ph* iff* Ala XI* OIa ola Aw 

345 250 25* 



Uf An Aim Pro Ly* JLU Olu Arg A*» Pro Arg Mo Pro Cyo Pr* Ola 

2i0 305 370 



Oly Ola Thr Vol Oly Oar Poo Am I** Oly Lou Arg Not Un Pro Ola 
215 200 205 

Ala lia for Al* Arg Lau Oly Mr Lyo Val Lyo Lag aar Trp Ly* Lao 
200 205 300 

£ar Oly Xlo tax lyv Lau Olu Oar Gly Oly Tyr Aaa Lau Ybr Tyr Ola 

305 310 315 320 



val val 

ValPr*teHi*V*lAloflaxGlyLML«iArgProI*tt 



Thr Pro Asp Gly Lau val aar val ola aar Lyo 
325 330 



340 



345 



350 



335 
Olu 



Ala Aaa Ala Lou Sar Ly* iiou Tyr Tyr pro Pro Vol Ala Ha val 

355 300 345 

Sar Ila Sar Tyr Pro Ly* Olu Ala Zla Arg Thr Olu Cya Lau Xla Aop 

370 375 340 

Gly Olu Lau Lyo Gly Pb* Gly Ola Lou Bi» Pro Arg Thr Ola Oly Val 
3 OS 390 305 400 



Olu Thr Lou Oly Thx XI* jyr Oar Sar Sar Lou Pba Pro 
405 410 



Arg Ala 
415 



Pro Pro Oly Arg XI* tau Lau Lou Aaa Tyr Xla Gly Gly Sax Thr 
420 425 430 

Thr Gly XI* Lou Sar Ly* Sar Glu Oly Olu Lau Vol Glu Ala Val 
435 440 445 



Arg Aop Lau Arg Lyo Not Lau XI* Lya Pro 
450 455 



aar Ttar 
4C0 



Pro L*u 



Lyo Lau Oly Val Arg Val Trp Pro Ola Ala Xla fro Ola Ro Lou Vol 
4€S 470 473 400 
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as 



39 



Oly Mio ft» A*» XU fcvu Aft? 

405 

oly Tyr olu Qly Im ffco 

500 



Lyo Mr tar Ui Ttv tar Mr 
490 4*9 



Oly Gly Aon Tyr vu au oly v»i AU 

505 510 



Lou Oly A*f Cyi V*l Olu Oly Mi tyr Olu tfcr AU XU Olu «■! am 

513 MO Uft 

AmitMfl«t9*rAcoTyrAl*iyri*i 

550 535 



12) vKfxmancm worn «*Q zd 

(i) 



iiii 



U) LmoniT l79t taM poirt 

(■) TTW; fiuelolc *cl4 

IC) mAMCt&Hifa^ olaou 

CD) TQfQLOOTr llaw 



(11) MPUKOL1 TYVIi 

dill irrfomrzcAL; w 

tlv) ANTX-S0H: BO 



tlx) 



(A) NMB/VIT: CDS 

IB) LOCATTOB: 7Q..1554 

(D) OTHER ZNPtiaiaTZW: /net— «Arafe*dflfMtU 
fro- prfDC-l- 



3 



Wl ' HiAUJ AA ' HCCOICA CffX TT KJUA CtOOIGAGM ATtTTGACVC 



CAOATAOCA ATC OCO TCT GOA OCA OTA OCA OAT CAT CAA ATT 

ttftt AU fi«r Oly AU V»l AU A«p Hi* 01* XI* Olu AU 
15 10 

OTT TCA 6GA AAA NBA GtC OCA OIC OCA OOP OCA OOF OTA AOT OQA CTT 
VaI s*r Gly Ly» Arg VmX AU val Val Oly AU Gly V*l 3er Oly 1m 
15 20 25 

GOG GCG OCT TAC AAO TIG AAA TOG AOG GOT VtG AAT OfC ACT OXO TTT 
AUAUAltiyrlyiUui^iSirAxv Oly l*u An vtltbrVil A« 
30 55 40 45 

OAA OCT GAT OOA AAA OXA 09? OOG AAO TTO AGA AOT OTT ATO CAA AAT 
Olu Al« A*p Oly ax? Mml Oly Oly Ly» L+u Ax* Iter Vftl IM Oln Ab 
50 55 40 



OOT TTG An TOO OAT OAA OOA OCA AAC ACC ASG ACf OAO QCf (AO 
Gly Uu XI* Ttp Asp Olu Gly AU AJB Tttr wc ttor olu AU GU Pro 
(5 70 75 

OAA OTT OOO AOT TEA CTT GAT GAT CTT OdO CTT COT GAG AAA CAA CAA 
Olu Val Oly S« LM Un M» lo Uu Oly Z*u Arg Glu £*» Oln OLo 
•0 «5 K 



40 
100 

154 

204 

252 

500 

340 
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fta fTO 
§5 

CM ATO CTA 
Val pot Liu 

110 

TCT ACC CAA 

Mr Thr olo 



AAA AAO TCC 

14* Lyo for 



0*0 TTC fTT 
01\1 fDO fDO 
ICO 

OAC OCT TTT 
AOp fTO FN 

175 

AT© AAO CAT 
ftot Ly* His 

ifo 

TCf ATT ATA 
S*r Xlo Ilo 



AAA MOT ACA 

X*» sir Arg 



GOG TCA TTC 
Gly 9mr Pho 
240 

TOC AAA ACT 

cyo tor* s«r 

255 

TCT TTC TCT 
So* Lou SOT 

270 

TCT CTT TOG 
Cyo val Oor 



OCA ATT ATC 

VAl Zl* MOt 



AAA GGA OOA 

Lyo aly Gly 
320 

ATO CCC CTC 
Mt ProUu 



TCA CAO 
•vr Ola 



OCT ACC 
fro Thr 



TCT AAO 
tor l<yf 
130 

TCA AAA 
Bor Lyo 

145 

caa cat 
qId Arg 



val Oly 



TCT TTC 

sor fao 



CTC OCT 

val aly 

210 

GAC ACA 

Asp Thr 

225 

TCT T^T 
$mt Ffco 



CTC TCA 
Lou Sor 



TAC AAT 
Tyr A«& 



CAT AAT 
Hi* An 
290 

ACC OCT 
Thr AU 
30S 

CAA CCC 
Gin fro 



TCQ (TTT 
Sor Vol 



AAT 

Aon 

115 

TTT 
ffco 



ate 

Vol 



CAT 



MO 000 
Ly* Ar« 

100 

CCC ATA 

fro xlo 



CAA ATC 

aiA llo 



TAT MT 910 000 
Tyr xlo vol Ar| 

OAO CRI 9fC ACA 
Olu Low Vol Thr 
120 



AJVf OOT QM WV 
am aly ^1 fro 



300 



Uu lm Olu fro 

135 



tCA OAT OCA TCT QCt CAA 
Mr Aop AU Oor Alo olu 

150 



JOT HOT 9IO CTC 
for oor Val loo 

125 



TTT TO TOO 
fnO boo Trp 
140 

QAA MT 0» 
Ola oor vol 

155 



aly 



CCA 
Pro 
155 



AAC 

Lyo 



TTT OOA 
Wtm Oly 
145 

ACA Mf 
Thr Oor 
140 

GAT CTC 
Asp Lou 



ATC AOA 
Xlo Arp 

JUST TCT 



CAA OAO OTT OTf 
OlA Olu vol vol 



OCT 



oteiMCfeifte 
Aop Tyr boo xVo 

170 



1*» 



CAT 
His 



TCT 
£or 

275 

GAA 

Glu 



CCT 
Pro 



Pbo 



TTA 



009 OOA 

Gly Gly 
245 

OAT GAG 
A#p 01U 

260 

OCA TCA 
Gly Oor 



TOO AAT 
Trp Aoo 



ACA AAG 

Ttar lyo 
215 

OCT OOC 

fro Gly 
230 

ATO CAG 
Not Gift 



Aop fro 
105 

OTA GAG 
Vol Olu 

200 

TTT OCT 



J*o 

OCT AAA 

Alt Lyo 



ATC AAT 
Xlo AOO 



ACA CAG 
Aro GlO 



ACC CAG 
9%r Olo 



CIO TOC 

Lou cyo 



CAG CTA 

Gin Uu 
325 

ATC ACC 
XlO TUT 



AOA CAA 
Arg Gin 

AAT GTC 
AOA VOl 

310 

AAC TTT 



ACA AAA 

Tfer l*o 



att err 

Ho LOU 



TTA OAC 
Lou A*P 
215 

GAG AAC 
GlU AMD 

280 

AAC CCC 
Asa Pro 



AAC GAG 

Ly* Glu 



CTC CCC 
Lou Pro 



AAA OCT 

Lyo Gly 
215 

CCT OAT 
Pro Aop 

250 

TCC AAG 
1*0 



TOO TCA 

Trp Box 



CAT TAT 
oio Tyr 



ATO AAA 
NOt ItfO 

315 

GAG ATT 
Glu llo 

330 



!*» Gly 
205 

oor oor 
Gly Oly 
220 

TCC CDT 
Oct ATO 



AGO no 



QTA CTC 

vol 



Lou Sox 
245 

GAT OCT 
Aop Alo 

300 

OTT ATO 

Vol On 



AAT TAC 

Jum Tyr 



4f2 



54# 



490 



ACA TTC ACA AAG 
Thr fbo Ttar Ly» 
20 



olu Lyo Vju lyO 



732 



700 



021 



474 



424 



972 



1020 



1000 



1U4 
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335 



940 



94$ 



*9 



3 



AOAccTCTTOAAOQcrrtoooffrACicATTccA 

Ar« Jxo MU Olu Oly PM Oly val Ub U *r© 
330 395 3« 

CATOOTTTCAAAACTCTAOOrAGACTTTrrTCA 

Kii oly fim Lyv Thr mu oly Thr Mu Pho far 

370 375 

OUT 09f TCC OCT AOT CMC OTT CAT CIA AGA 
MP Ai« kr tro Nr Vil RU 1m lyr Thr 
3i5 330 

M7rAaOAA6GA0aAACf»0CCAaAGCrTOCACT 
Mr Arg Mm Ola olu Mm Ala Ly* AM Mr Tter 
400 405 



1CV MOM 
Mr Lytt Olu Ola Vf 
345 

TCA ATfl AT0 TTT CCA 
Mr vk Itot N ho 
340 

ACT TTT AfT Off* 000 
Thr Km XM oly Oly 
995 



ertavoACTtCTOACCTrcMooicTorn) 
Val vkl Thr Mr Asp Mu Qln Arg km Ma Oly 
415 420 

0T0 7CT OTC A*C CAT TAC TAT TOO AOS AAA OCA 

val Mr Val Asa KM Tyr Tyr Trp Are lyo Ala 
430 435 440 

AOC AOC TAT OAC TCA OIC ATS QAA OCA ATT 0*C 
Mr Mr Tyr Asp Mr Val MtC Olu Ala Zla Asp 
450 455 

CTA OCT GOO TTC TIC TAT OCA GOT AAT CAT OQA 

MU fro Oly Vbm Ft* Tyr Ala oly Ana Bis Arg 
445 470 

OOG AAA TCA A*A OCA TCA GOT TOC AAA OCA OCT 
Oly Lyt Mr Ilo Ala Mr Oly Cy* Lya Ala AM 
400 445 

TAC CTG GAG TCT TOC TCA AAT OAC AAC AAA CCA 
Tyr Mu Olu Sir Cyi Str Aan Asp Lyv tors Pro 
455 500 

AAOOTTCOTC OCTTTTTAK ACTTACTTTO TAAACTTOtA 

CQATTAOCCA ACAACTCA0C AAAAO C CaflA TTC1CA9AAO 

ACTATTTATO TAAAA 



imp olu Mu Vf* 01* 
410 

QTt OAA OCT OAA CCC 
Val Olv Oly Olu Kro 
425 



tfaa Pro Mu Tyr Asp 
445 

AAO ATO QAO AAT OAT 
Lya Mac olu 



006 000 CTC TCT OTT 
Oly Oly mu Mr val 
475 



11*4 



1212 



X2M 



13M 



1354 



1404 



1452 



1500 



OAC CTT GtO JSC «CA 1541 
Asp Im Val Zla aax 
4S0 

AAT OAC AOC TZA TA&CATTOTC 1403 
Asn Aap Mr Mu 

505 

AAATOCAACA A jJirauU/ltt 1C43 

1723" 
1730 



(2) ZKFORMATZ0N FOR StQ ZD P0:4: 

(i) SBQGBKB CKARACmiSTZCS; 

<A) LENGTH: 500 amino acl4a 
W 1YFV: acid 
(D) TOPOLQGT: 11MOX 

<ii> sCLKULS TTtli proMM 

Cxi) ssQomes zwwimtx: bsq zd moiiz 

Ml Ala S#r Oly Ala Val Ala Aap ttis Gill Zla Olu 
15 10 



Val Mr Gly 
15 . 
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Aff 


Val Ala 


val Val oly Ala 


Oly 


val far Oly 


lat Ala 

30 


Ala 


Ala 


*yr 


iya 


t«u Lya 


*or Ar* Oly Lau 
40 




Val Thr val 


Fha Ola 
05 


Ala 




oiy 


Arg 

50 


val oly 


Oly Lya Im Ar* 


Mr 


val Mat Ola 
CO 


as* oly 


Lao Ila 


Trp 




Olu oiy 


Ala A«a Thr Mat 


tt 


olu Ala oly 
75 


Pro olu 


Vod 


«iy 

80 


Oar 


L«U 


Uu AO 


Aap Lau Oly Lau 
15 


*rg 


Olu Lya Olft 
50 


am n* 


Pro Zla 
05 


a«r 


Ol& 


Lya Lya 

100 


Arv TVT Xla Val 


10? 


AM Oly Val 


pro Val 

1 1 A 






fro 


Ttar 


A*n 1TO 

115 


Ila olu Loo val 
120 


Thr 


0«r Oar val 


125 




ola 


Mr 


iv» 

130 


Fha Oln 


Ha Ua Lou Olu 

135 


Pro 


pha Lao t*p 
140 


im 


Lya 




sar 
145 


Lya 


Val Sax 


Aap Ala 8ar Ala 

150 


Olu 


Olu flor Val 

155 


aor olu 




100 




At© 


Hii Pha 


Gly oln Clu val 

165 


val 


Mp Tyr Lao 

17Q 


Ila Aap 


Pro Pha 

175 


val 


oly 


Oly Thr 
190 


Oar Ala Al* Aap 


Fro 

185 


A*p Sor Lau 


Sax Mat 

lto 


Lya 


BlO 


Sar 


Fha 


Prt Asp 


Lw Trp A*n val 

200 


olu 


Lya 0«r Fha 


Oly «ax 

205 


ila 


Xla 


Val 


oly 
210 


XI* IU 


Ar? Thr LyA Fha 

215 


Ala 


Ala Lya Oly 

220 


Gly Lya 




Arg 


Aap 

225 


Thr 


Lya Sar 


Sax Pro Oly Thr 

230 


Lya 


Lya Oly sor 

235 


Arg oly 




240 


s«r 




Lya Cly 


Oly Mac Gin Ila 


LOU 


Pro Aap Thr 

ISO 


Lau cya 


Lya 

265 




LflU 


Sar 


Hit Aap 


OlU Zla AM Lau 


Aap 


Sax Lya Val 


Lav *ax 

270 


Lau 




Tyr 


Aan 


sar Gly 

i75 


S«r Ary Gift Olu 
200 


Aan 


Trp s*r Lau 


S+T Cy» 
285 


val 


8ar 


His 


Attn 

290 


Clu Thr 


G1& Art Oln Asa 

295 


Pro 


Uia tyr Aap 

300 


Ala Val 


Xla 


Hat 


TV**- 
3QS 


Ala 


Pro 


Gya An vol Lya 

310 


Ol* 


315 


Hat Lya 


oly Cly 
120 


fii« 


Pro 


Fb* QlA 


Uq An pha Lau 

325 


Pro 


Olu zla aaa 

330 


Tyr Hat 


Pro 
335 


Lau 


*ar 


val 


uu xia 

340 


Thr Thr fha Thr 


Lya 

345 


Glu Lya val 


Lya Arfl 

350 


Pro 


Lau 



31 
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Olu Oly 


355 


Oly V*J Uu XX* 


Pro tar 1^1 Ola cOfi 
3fO 


ly» lio Oly 

345 


PM 




Ztft 7fer 
370 


X*U 


Oly Thr l«u pf* 

375 


tar Mr mt Jtot »b« 
310 


Pro Atp Ac^ 


Mr 




Pro Mr 
305 


A»P 


Vul Hl# L«u Tyr 
350 


mr Ar n* 11* oly 
305 


Oly Mr Are; 


Atn 
400 


H 


Oln Olu 




Alt Ly» Al* Mr 
405 


T*r A*p Qlu l«a iy* 
410 


Ola Vml Vul 
415 


Tte 


L"-s 


Mr a*p 


L«V 


Gin Ary L«u L*a 
430 


oly vtl olu oly Olu 
425 


Pro VaI Mr 
430 


Vftl 


■a ■ 


Am Hi* 


Tyr 
435 


Tyr Tip Arg Ly» 


AU ib* Pro l«u vyr 
440 


Asp Mr Mr 
445 


Tyr 


E v% ' * 


A*p Mr 

450 


V*l 


lMt Olu Ala Xla 
455 


A«p X*» Mtc Olu A*n 
440 


Asp fcgu fro Oly 




fi» rb* 

445 


^yr 


Alt ttly Am Kit 
470 


Ary Oly Oly Um Mr 
475 


Vftl Oly Ly» 


400 


if!*'- 


Xl« Al* 


Mr 


Oly Cy» Ly« Aim 
485 


Al* Asp L*4 V*l lit 
490 


Mz tyr Uu 


Olu 




St Cym 


Mr 


Ac aid Xiy» l4yu 


Pro Aso Acp Mr L*u 







SO0 505 

(3) XUPQMOKXCM PO* SJQ ID l»;5: 

(i) SSQOBUCS CHARACTBWST1CT; 

(A) LENGTH; 169B twM p»±ru 
ffi) TTPt: aucloic acid 
|C) STRAKOTIKESS: tingle 
(D) TOPOLOGY: linear 

(11) MOLKDXiB IWs ctttA 

(lit) HYPOTHTTTCAL: MO 

(iV> AUTt^SQtSB; HO 



<ix> rWMVRti 

(A> MANE/KIT: CDS 
<B) LOCATION 2.. 1453 

(D) 0TKTR INFORMATION /hot*- •ItuiM protean 1 
full-longtfc) ; Mqunc* froui pMDC~4» 



<Xi) seOOIHCK MSCKXFTXW: SBQ IP 110:5: 

O AAT TOO OCO Owe TOC OTC 0T0 9TC ODC OOA GOC ATC AOT OQC CTC 
AM Mr Alt A*p Cym Vtl Val V*l Oly Oly Oly XI* Mr Oly Lou 
IS 10 15 

TOC ACC OCO CAC OCG CTC OCC AOS COO CAC OOC OK OOS GAC OTO CTT 
Cy« Tter AlA Gin Al* hmi Alt Ttir Ar« Kia Oly VAl Oly Aap ViU. Lai 
20 25 30 



4« 



54 
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OK ACQ 
Vml Thr 



OQ? coc 
Arg Pro 



ccc TW 
Pro sor 

gac no 

A9P Lou 

$0 

000 AM 
Gly Ly« 



Ottf CTC 
A*P Lou 



OOC ATC 

oiy xu 



OT© OOC 

vai Arg 
145 

'l'J-1— TCC 

Pho Cy* 

160 

OCT OCA 



ATT GOT 

lit Gly 

cca cce 

Pro Pro 



tct we 
Smr Pho 
225 

TO GOT 

Uu oly 

fa GAT 
•or Aop 



orr too 
Vol Mr 



om aoc 

OlU AU 

OM «A 

GlU Olu 

CMC CCC 

Arp tro 



GTT TTT 



ere wo 

l*u Ar? 



ATO aoc 
tot MT 

115 

ooc ceo 
Arg Pro 
130 

OOC AM 
Arg Am 



TCA GOT 
Sor Gly 



ITT 000 

9t>» oly 



00A ACC 

oly 'ftr 
195 

AOG OAf 

Arg Asp 

210 

AO0 AM 

Arg Ly« 



aot aaa 

Mr LyO 



0AC AAC 

Ao? ly» 



OTO CM 

v*i Gin 

275 



ooc ooc ooc 

Arg AU Arg 



GOO TAC CTC 

Oly Tyr i*u 



Off CTC ACC 
vol Lou Thr 

70 

GOG GAC CCA 
Oly Aop Pro 

as 

CCC <7TO CCA 
Pro vol Pro 
100 

atc oca no 
Urn Pro Oly 



CCT OCT CCA 
Pro Pro Pro 



CTC OOT OCT 
Utt Oly AU 
150 

GTC TAT OCT 

v*l Tyr AU 

165 

AM QTT TOO 
Ly« Vol Trp 
180 

ATC AM ACA 
llo Ly» Thr 



OOC OOC CTT 
AU Arg Imi 



OOT CTT OCC 
Oly Lou AlA 
230 

OTC AAA CTA 
Vol Lya 



C0C OK 
Pro Oly 
40 

TOO 0*0 
Trp Olu 
55 

ATO OCC 



25 ^ 



AAC 000 



TAT OTT 

oly tyr Vol 

OCT AAA ACT 
Ala Ly» Mr 



TCC AAO 

for Ly» 



AAO CTC 
Lya Lou 

uo 

OOC OOC 

Oly Arg 

335 

OM OK 
Olu Vol 



OOT OAT 
Oly A*p 



cog no 

Arg LOU 



AST CM 
Zl« Oln 
200 

CCC AM 
Pro Lyo 

215 

ATC CTT 
Hot Lou 



TCA WC 
SOT Tip 



TTO QMS 
Uu Olu 



OTT ATC 
Vol llo 
390 



OM OOT 
Olu Oly 



OTQ GAC 
Vtl A«p 



COO COT 
Pro Arg 
90 

CCC OOC 
Pro Ala 

105 

AOG OCC 

Arg AU 



GAA OM 

Olu GlU 



TTT OM 
Pfe* Olu 



CCT TCP 
Pro so* 

170 

GAA GAA 

Olu Olu 
155 

GM AGO 
Olu Arg 



CCA AAA 
Pro Ly* 



CCA AAT 
Pro Am 



AAA CTC 
Ly* Leu 
250 

TAT OAA 
Tyr Olu 
315 

ATO ACT 
Mac Thr 



ATT ACC 
II Thr 



CCC AAC 
Pro An 
50 

AOC OGA 
9or Oly 

75 

TTC OTO 
Pfc* Vol 



OAC CtC 
A4p lou 



GOT CTA 
Oly Lou 



TCA gto 
Sor VAl 
140 



etc 
Arg liou 

155 

AM CTC 
Ly» Lou 



ACT GOA 
Thr Gly 



AOC AM 
Sor lyo 



O00 CM 
Gly Ola 
230 

occ An 

AU Uo 

MS 

AGO AOC 
Thr Sor 



ACC GTC 0M 
Thr vol Olu 
45 

AOC TTC CM 

vox Pbo ou 



CIO AM OAT 
x*u Lyg A*p 



C9G TOO GM 
imi Trp Olu 
15 

CCC TTC ^TC 
Pro Vmm Pho 

110 



Thr Pro 



ATT CCA 

Zl« Pro 



CTT 
Oly AU Lou 
125 

OM OM TIC 
Olu GlU Pho 



ATT GM OCT 
XI* Olu Pro 



AOC ATO AM 
sor Hat Lya 

175 

GOT AOT AT? 

oly sor zio 
190 

AAT CCA AAA 
Aon Pro Kya 
205 

ACA GTT OCA 
Thr V*l AU 



ACA TCC AOC 
Thr Sor Sor 



ATT ACA AAA 

llo Thr Lya 

255 

OAA GOO OTT 
Olu Oly Vol 
370 

TCA TJUT OTT 

sox Tyr vol 
255 



142 



190 



259 



395 



394 



430 



479 



S2€ 



574 



422 



570 



719 



744 



914 



942 
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■t 



■ * 



290 



29) 



105 



310 



330 



325 



340 



355 



370 



375 



385 



390 



400 



405 



420 



435 



450 



455 



445 



470 



TCA 
Mmr 


AOC OAT OCt OCA OA* OCT CTA 
Sat A»p Alt AlA Al* bW 

300 


910 


OCT 
AU 


OCT <7X3t ICT OTT TOO IJff OCA 

Al* va! tfar v*l s«r tyr >tp 

315 


550 


ft* 
L*u 


att out oaa oaa ere c*o oac 

11* A#P Oly Olu un Ola Oly 
330 333 


1004 


CAA 

oia 


qoa an OM aca n% oqa ao 
oly vai alu tte lm aiy fbr 
345 350 


LW9A 


AAT 

An 

340 


gm mm Oflflh flHl 
VVi OCT \A#T UDL UwT MM W1V 

Axy AlA 1*Q A#p Oly Ary VaI 
3C5 


1102 


OCT 
All 


ACA AAC AGA OOA JUT tZTT Tw 

Tto An Thr Oly ii« VAl Mr 
300 


m uv 


OCA 
AU 


OTT OaC 00T OAC CTC G9A AAA 
Vfcl A*p Arg A*p X4W Arg Ly* 
395 


1191 


OAC 
MP 


ccr m ore czt oot or coa 

Pro L*U VaI IMX Oly VAl Ary 
410 415 


1244 


TTC 


*mwm*i Mft /vu am* mm n&M ^^f*W 

CTI* OTA CAT V* * CTT 

X*m VAl Oly Vi* X-ftu A#p Lm 
425 430 




OAC 
Aap 
440 


CQA 00T 00C TAC OaT 000 CTC 
Arg oly Oly Tyr A*p oly Lm 


134) 


OQA 

aiy 


OTT OCC CTQ OQC HSK tOC OTT 
VaI AIa Mtt oly Arg Cy» VAl 
440 


1390 


CAA 
01» 


ATA TCT OAC TTC TTC ACC AAO 
II* Mr A»rp Fb* Imk Thr Ly» 
475 


1434 



TAT OCC TAC AAO TQATOAAAGA AOTOOAOOOC TACTTOTTAA TCOTTtATOf 1490 

Tyr aIa Tyr Ly- 
480 

TOCATAOAtO A0Q79OCTCC GGOGAAAAAA AAOCTtOXaT AOTATTtTTT A'l'lCJTATTT 1550 

TOTAAAtnC ATTTCTOttC TTTTTTCTAT CACTTAAtTAC TTATATTTtA OTTCTOTAOQ lflD 

AOATTortCT (TRCACtOCC CTTCAAAAOa AAtTTTATTT tTCATtCTTT TAtOAOAflCT 1470 

CmiCTACTTA AAAAAAAMI AAAAAAAA 1490 

(2) umnATioir rw «0 » moi4: 
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■3* 



so* 



•V 1 ' 
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iffRdt 
(A) LMRVs 403 a*±n© at 
(I) TOi audi* add 



TTPIi prottltl 



11} 
(ii) 

Mr Ma cyi v«l Vil Oly oly oly 1U Mr ny &w Cyi 

IS 10 15 

20 25 30 

Tte Olu Ala Arg Ala Arg Pro Oly Oly AM Xlo tKr ttr Ml Olu A*g 
IS 40 45 

fro olu Olu Oly IVr l-uu Trp Ola OXu oly fro mb Mr PM Ola P» 
50 55 «• 

fS 70 7S 10 

!MV^fiHGlyJ^tnAiAAl«rroAr9taVftli«iTxpaia0ly 
•S 90 9ft 

Ly< x^u Arg ftt) Vil Kro 8«r I^i fro JO* A*p L«*i Pro ffa* Pba A*p 

^00 105 wo 

Lva Mt Mr Zli Pro Gly Lyu Lau Mff AlA Oly L*U Oly AX» Mtt Oly 

1X5 UO 1^5 

Xlo Arg Pro Pro Pro Pro oly Arg Olu Olu tar voi olu Ola* PM Vol 
150 135 140 

Arg Arg An Lau Oly Ala Olu Vol rte Olu Avg Lau Xla Olu Pro Jfea 
145 150 15S 1*0 

CyoMrOlyVulTyrAAaGlyAJpl«MrI^Lauio*Mu^EAla 

1C5 1^0 ITS 

Alt, Ffc* Gly &ya Vml Trp Arg Liu Olu Glu Hur Oly Oly S«r Xlo Xlo 
100 IBS 190 

Gly Gly The Xlo Ly» Thr X2« Gin alu Ax-g S«r Lya Am Pro Ly* Pro 

195 200 205 

Pro Arg Aop Ala Arg L«u Pro Ly» Pro Lyo Gly Glfl thr Val AlA Mr 

210 215 220 



PM 

225 


AXV 




Oly LuU Ala Mt 

230 


t«u Pro 


Jutn AU xlo 

235 


ttor Mr Mr ban 
240 


oly 




Ly» 


val Lya Lau Sax 
245 


trp L/s 


Lmi tte Mr 
250 


Xla Iter ItfO Mr 
255 


A» 


Aap 


I** 


Oly Tyr val Lau 
240 


Olu tyr 

2€5 


Olu Tfer Pro 


Olu Oly VaX val 
270 


Mr 


val 


Qle 

275 


Ala Ly» Mr val 


Ua UK 

240 


iter Ila Pro 


«ax Tyr vol Al* 

ass 
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"3 



A* 



ato 




3*5 




300 








Al* 

*05 


iyr Tyrf pro 


Pi* V*l 
310 


Al* Ala V*l *hr 

313 


V*l 


Ovr Tyr 


fr* 


ty« 

330 


Olu Al# 


xl* Arg iff* 01* cyw 


im ii* am oir oi* 

330 




oly 

333 




Oly 01» 


LM His fw Arg tor 
340 


ola Oly Vfcl oi« 
343 




m oly 

350 


in* 


XI* 




339 




340 




345 






370 


km Tyr Xl* 


oly oly 
375 


AUIbr Mlkr 


Oly 

310 


XI* v*i 


•or 




Ar elu 
39$ 




Vfcl 01* 
3*0 


3*5 






Iff* 


4M 


Uu II* 


am Mr Ttar 
40S 


Al* Vfcl 


An too L*u V*l 
410 


LM 


oly Vfcl 


Art 

415 


Vfcl 


Ttp FM Ola Al* Xl* 

oao 


frv 01* 


*» b*u V&l Oly 
435 


ml* 


430 






Gltt Al* 


AU Iff* Al* 

435 


Al* hmi 


am A** Oly oly 

440 


tyr 


mi> oly 

445 






L#u Oly Oly An Tyr 
450 


V*l xl* 
455 


Oly Vll Al* L*u 


oly Ar* cy« 
440 


Vfcl 


An 


Oly AU 
445 


tyr Glu Smr 


Al* Mr 

470 


Gin XI* Mr am 
475 




ban Ifcr Ly» Tyr 
4M 



ai* tyr iff* 
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(1) 



ffXOM K* SIQ XD »:7: 



aiuu«Bumc8! 

(A) LBK7TB: 2041 b*** **!»■ 

(B) TTPX: zntel«lc k14 

(C) 5mAHZ«MSSS: *liigl# 
<D) TOPOLOGY: lio**r 



(11) 
(ill) 
(lv) 

lix) 



UfM/fWTXGAL: *0 

nsi( » 



(A) 

(A) LOCAT I0 W: 44, ,1430 
(D) OflOI IVFOMATXCM: /l 



Bt*» 'lliu 
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*4i 



•V- 



•V- 



CAA ATO CTC 

I 

TAT OOC CAC 
Tyr AW Hi* 



OCT wo 
Al* L*U 



TO 
tor 

30 



ACT 

t 



OCC TCA OCC 
Al* 



CMC ACT 

ma 



Tar Ar* 



tCA TO OCT 
Mr 9#r Ala 

10 

OOC OOC OOC 
Arg Pr* ATf 
33 



TO CAT CCT 
mt Hi* fro 

15 

CTA COT OBO OfC 

uu mi mi «i) 

JO 



CfC OCO MO 000 



Al* Oly 

35 



TO CMC 



AM AM Pr* Arf 



C0C COT OCA OCO 



TO 



ore ooc ofC 

Vftl Alt V*l 

5* 



val Oly Al* 



CTC AOA CM 
In ax? Ola 

65 

000 OQA 00A 
Alt Gly Oly 
•0 

OAA 00A OCT 

oiu Qiy Ala 



ATT GAT OAT 
Xl* Asp AM 



CAC AAG COT 

Hi* Ly* Arg 

iao 

GAT CCC ATT 
AM pro 11* 
145 

ATT OCXS TTA 
XI* Ala L*u 

160 

AAC TCT GGA 
Am tar Gly 



TTC TCT GAA 
m cy* GlU 



OCA TTT OTA 
Pr* PM Val 

210 

OCT CAT OCA 
AM fii* Al* 

325 



AOC OOtf 

•or oly 



AAG ATA 

Ly* XI* 



AA£ AOC 
AM far 

10C 

CTT OGT 
Uu Oly 
115 

TAC ATT 

Tyr XI* 



OfO 

v*i 



Oly Val 

5§ 

A*C OVA 

AM Val 

70 



000 CTC 
Oly L*u AU 



45 

000 000 T9C 



ONnCOM 
Val pa* Oiu 

75 



000 AOC Aft* TO OAfl 



Ajy 

B5 

ATO 
Mot 



Olu Oly Oly 
90 



TTT OfC 
Ft» VAl 



ACA QAA 

Thr Ola 



got 
Oly 



95 
CtO 



CTA 



CAA QAC 

am Am 



TOO CTA 
Mr Ua 



TTT WT 
Rm Pa* 



AAA CtC 
Vym v«l 

180 

COC CAC 
Arg Si* 
195 

OCT OCA 
Al* Oly 



TTC OCA 
M» Pro 



ore 
vml 



AXG 
tot 



GAA 

GlU 

165 
TCT 



TTT 
9b* 



AAA OAT 
Lym Am 

U5 



ty* 

120 

OQA 

Oly 



AAA AOC ACT 
iyisv tor 

150 



Tter 

OCA 



CCA TTT 
Pi© Pa* 



OAC GAG 

GlU Glu 



QOA AOA 

Gly Ara 

ACT OCA 
tor AU 

215 

TTO TOO 
Uu Tip 

230 



CIC 
L*U 



CAC 
Hi* 



OAA 

alu 

200 



oly 

AAT 



alu Trp Ola 

105 

CAG CM TAT 
01a 01o Tyr 



OCA CCA OGA 
Al* pro Ala 



Ull LI I TCO 

v*l L*u tor 

155 

TAC AAG AAA 

Tyr t#* Ly* 

170 

TTG APT GAO 
Lau Scr Olu 
195 

GTT OTT OAC 
V*l V«JL AM 



AT* 

110 



OCT AAC 
Pro AM 

125 

CTO aTT 
L*u 11* 
140 



TCC CAA 
■or Ola 



CCT TO 
Pro tor 



ttr Ly* 
OCT AAC 



ACT OTT 
tor V*l 



GAT OCA GAG 
Am Glu 

TTO GAA AOA 

i*u oiu Arg 

23$ 

37 



Tyr ft* 

205 

TCA CTA 
9ar UU 

220 



Urm tyr 



Wa 



ACA ACA 

Tar Arg 

175 

0OO AOC 
Gly tor 
130 



GTT 

val 



TO AST 

tor n# 



GOT TCA 
Oly tar 



10* 



15* 



304 



253 



300 



394 



453 



540 



3«A 



436 



732 



7*0 
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OTT 
Vftl 
340 


ATT 
210 


ott 

Vlftl 


25 


occ 
Alp 


ATT 
Ilo 
345 


TIO TCT 
LOU AST 


aao 


en oca oct am oot opt oca 
lou Alo a1« Ly» Oly Asp Pro 

250 255 


ST9lS 

o«s 


OtA 


AAA 
1*0 


ACA 
Ar 


AOA 
Arg 


Cat 
ttio 


OAT 
Aop 


TCA TCA 
Mr Mr 


000 

oly 


AAA AOA AGO MT AOA OGA OTQ 
^ An Ar« Aoa ATV Ar« Vol 
245 270 


5-74 


too 
Mr 


pfao 


TCA 

Mr 


TVT 

nm 

375 


CUT 
Hi. 


GOT 

Oly 


OQA ATG 
Oly Mt 


CAC 

om 
uo 


TCA CTA ATA AAT OCA CTT CAC 
lor Uu 11- AOA Al* Lou lis 
215 


934 


AAT 
Asa 


QAA 


OTT 
Vol 

a»o 


OQA 

oly 


OAT 
A*p 


OAT 
Asp 


AAT OTQ 
AM Vlftl 
3*5 


AAO 

tyo 


CTT GOT ACA OAA OTO TW TCA 

Lou oly fur olu Vol loo tor 

300 


§72 


TTO 
LOV 


SA» A 

Al* 

90S 




Thr 




GAT 
A#P 


OSA GTT 

Oly Vol 
310 


OCT 
Pro 


OCA CtA OOC A00 TOG TO ATT 
Al* Ipj Oly *ro Trp Aox Ilo 

315 


1020 


f*r 


OTT 

Vol 


GAT 
AsP 


Mr 


AM 
Lyo 


OAT 
Asp 

325 


aoc oo? 
Mr aiy 


QAC 
Aop 


AAO OOC CTT OCT AOP AAC CM 
Ly* Asp Lou Ala Mr Am Gin 

330 335 


1040 


ACC 

Tor 


rho 


OAT 
Asp 


OCT 
Al* 


OTT 
340 


ATA 
IlO 


ATG ACA 
MC Thr 


OCT 
Al* 


CCA TTG TCA AAT OTC OOO AGO 
Pro Uu **r Asa vol *rg Arg 
345 350 


1114 


AAV 

Mt 


IK* 


Mr 

* 

pm 


ACC 

Thr 

335 


AM 

*yo 


Gly 


OQA OCT 
Gly Al* 


Pro 
340 


OTT OTT CTT GAC T^T CTT OCT 
Val vol Lm Aop Pbo I-tU Pro 
355 


1144 


MIS 

«#• 


AT» 

Mt 


OUT 

Asp 

170 


TAT 

Tyr 


VIA 

Uu 


Pro 


Lmi sot 

375 


CTC 
lA 


MB oTQ ACT OCT TTT AAO AAO 

mmt vol ttar Alo Pao Lyo Lyo 
310 


1212 


OAT 
Asp 


aim 

GMT 
A*P 
3B5 


MM 

ViV 

V*l 


AA9 


AAA 


WT 
PlT> 


wlw IpAA 

Lou clu 
390 


MM 

oly 


TTT OOO OTC VfA A"^ CCT XMC 

4 l l WW w • J •* Mian » 

Fbo Oly Vol Lou Ho Pro Tyr 
395 


1250 


MO 

Lyo 
400 


GM 
Glu 


m*/« 
CUB 

Ola 


Mil 

Civ 


AAA 

Lym 


Hi* 
405 


GOT CTO 
Gly LOU 


MM 

tyft 


ACC CT"? OOG ACT CTC TTT 5™ 3C 
Thr Lou Oly Tbr Lou pte Aor 
410 415 


1300 


TCA 
Mr 


AUG 
Nftt 


AJTG 
M 


■MM 

Ft* 


CCA 

Pro 

430 


GAT 
Aop 


Arg Ala 


Pro 


Art Mjh« vt&T TTA TAT ACA 

IpAf ^BwVW V^"M ln4 * **• Ml* m%^ 

Aop A*p Gin Tyr Lou Tyr Thr 
425 430 


1355 


ACA 

thr 


TTT 
Pbo 


(TTT 
Vol 


QUO 
Gly 
435 


GGT 

Gly 


AOC 
Mr 


CAC MT 
Kit MA 


AOA 
AT0 
440 


OAT CTT OCT OGA OCT CCA ACG 
Xmp Lou Alo Oly Al* Pro Thr 
445 


1404 


TCT 
Mr 


ATT 
Ilo 


CTC 
L«u 
450 


AAA 


CM 

Gl» 


CTT 


ora acc 

VAl Thr 
455 


TCT 
Mr 


GAC CTT AAA AAA CTC TTG OOC 
Asp LOU 1*0 Lyo LOU Li«u lily 
440 


1452 


GTTA 
Vol 


GAG 
Olu 
4(3 


COS 
Gly 


CM 
Q1A 


CCA 

Pro 


ACT 

Thr 


T^T GlC 

PM vol 

470 


AA0 
!*■ 


CAT OTA TAC TOO OQA AAT OCT 
Bio Vml Tyr Trp Gly A-m Al* 
475 

* 


1500 


TIT 
Pho 


CCT 
Pro 


TO 


TAT 

tyr 


OOC 

Gly 


CAT 


GAT TAT 
Asp tyr 


AGT 

Mr 


TCT OTA TtO OAA OCT At* OM 
Aox Vol IsM OlU Aim Zlo Olu 


1540 
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400 



4M 



l*« UK Olu i** »m Ua *ro *ly Dm tyr AW Oly AM M* l*» 
500 505 510 



Mp Qly l*u fcl* v»l Qly Mr val xia Ala Mr oly Mr g» Ala AU 

$1$ 320 %n 

Aap uv Alt tie Mr iyr Uu Olu Mr ill Ar tr* Hi md M Iff 



930 

CAT VOMMlOtC WaCCTAlCC 
His 

945 



935 



940 



AMACfwrn T va c uxT-UA uactrKxrvr 
noMOTToa AUTiom aaai i icmu 
AJ AinuicT xrwroArrr tvxwxmt 




19M 
1444 

m* 

AM 
10*9 

lt#5 
3045 

aoti 



-To 



33 



(2) ZMFOMkTXCIf FOR S1Q ID H0sl: 

(1) 



(At UMDftf: 544 MlAQ acid* 
f») TYPE: olao acid 

<D> TOPOLOGY: liMa* 



(11) MOUrmil TTWs prottiA 

(jd) sw o mci z*scftxmcai.» siq xx> *>:•; 

Mt X*u All law Iter Alt Mr All te« Stt 

15 10 



Ha Pro lyr 

19 



Arg Hit Ala Mr Ala Hia lfcr Azy Arg Pro axg L#u Ary Ala Val law 

30 25 30 

Ala Nat Ala <Sly Sar Aap Aap Pro Ary Ala Ala Pro Ala Ary Mr Val 
35 40 49 

Ala Val Val Gly Ala Oly Val Mr Gly Umi Ala Ala Ala iyr Ary law 
50 55 40 



Ary Gla Mr Oly Val Aan Val Tte val Pba Olu Ala Ala 

65 70 75 



Ary Ala 



Qly Oly l*y* Ila Arfl ttor in Mr Olu Oly Qly Pha Val *xp Aap Olu 
09 90 9$ 

Oly *la A*n Tfcr Mat Tfer Olu oly Qlu *rp Olu Ala Mr Ary LaT Xla 
100 105 110 

3» 
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•-v 



■p Lya Ola din Tyr fro Aaa far 

1*0 135 

tap Arg tyr Ila Val Lya Aap Qly Ala »r» Ala KM tia taa 

no m i*o 



•t* na 

141 



150 



194 



na 

ioo 



All Lau Fba fba 


olv 
145 


fro Ppa bfll 


fyr Lya Lya Ala Aaa Tta Arg Aaa 

1*0 1^5 


Mr uy Lya ▼a* 

ioo 




Sin Alt* Mia 
WAV MA 11 H*B 


Lav 
105 


tar Gla oar 1*1 Qly oar ffta 

IfO 


f^rt fllu i««t Mkm 
vy* WV "V 

its 




Q1.V AaM filv 

300 


*U 


val Aaa tyr fta tal tap pro 

305 


p** mi ^* aiv 

m MAM 

ftlft 




tar Ala Gly 

215 


As* 


Pro Oltt Oar baa aar tla Arg 

220 


■IP JUA VPS «V 

aas 


Ml 


*MU T*P Mi 

230 




Gin Arg Lya tyr Gly aar tal 
335 140 


Xla Val oly Ala 


Xla 
345 


UuMriyf 


Lau 


Ala Ala Ly> Gly Aap Pro vol 

250 355 


Ly» tbr Arg *ia 

210 




tar A*r Olv 
^mr ht Wif 


Lys Arg An Aan'Arg Arg vol tar 

245 3*90 


At IM IteHil 

2?5 


oiy 


Qly tat Gla 
240 




Imu tla AM Ala tan Mm Ami 
245 


Olu vol Gly Aap 

290 


MP 


Aaa val Lva 

MM* »»* "V* 

305 


law 


Gly tbr fllu Val Lau tar Lau 
300 


AltCyiThrlM 

305 




filv Uml Wn 
vaa r*w 

310 


Ala 


lmj Oiy Arp T»p aar Xla tar 

315 MO 


Val Aap tar Lya 


Aap 


tar Gly tap 


**• 


Aap lama Ala Aar Ami Gla Iter 
330 335 


rba Aap Ala Vui 

>40 


tla 


tat tbr Alt 


fru 

145 


Imu tar ami Val Arg Arg tat 

350 


Lys Pha Tbr Ly» Gly 

355 


Gly Ala fro 
340 


val 


Val Lau Aap fba Lau fro Lya 

345 


tat JUp Tyr Uu 

3*0 


fro 


L*u tar tau 

375 


an Vtl Tte Ala Fba lya Lya Aap 
300 


Asp val tya lym 
305 


Pro 


Lau GlU Gly 
300 


Fba Gly val Lau tla Pro Tyr Lya 
355 400 


Qlu Gl* Ola Lys 


Bia 

405 


Gly Lau ty» 


tbr 


Lau Oly tbr im ffaa tar tar 
410 415 


430 


Aap 


Arg AU Pro 


425 


Aap Gin Tyr Lau tyr tbr tbr 

430 


Mm val Gly Qly 
43$ 




Ala Aaa Arg 

440 




Imu Ala Gly Ala Pro Tbr aar 
445 
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tl L*u Lyi OlA im v«l tfar Mr *«t» l*v ly» Ly« L«o Ui Oly Vftl 
450 435 440 

Oly Qly ol» Pro fte fh# V*l Ly* Kit v*l iyr T»p Oly Mn *w 
4(5 470 475 4*0 

?ro t«u Tyr Oly *ii A*p ?yr ««r tor V»l l*u Glu Al* Xlv Glu by* 
4tS 490 495 

••t Glu Ly» A*n ft* Oly Ph» Fb» Tyr AU aiy Am mt ly» Agf 

500 503 510 

Oly L«m Alt v*l aiy lit vil n« AU lir oly itr tjyt JOt AU 
515 520 525 



LiU Alt xl« «•* Tyr l*u Qlu tor *!■ ?far X*» ■!» 
530 535 540 



tar *i« 



i2) mn*mrxo« to «o x» 

CD stoma cioiMcmz«zcts 

(A) tfWWi 1811 fa*M 
(83 TTFB: micl«ic «cl4 

<a snuucnmnas: •lagi* 

<E» TOPOLOOTf liAGtr 
Uil M0UCQU TTPl: cXKA 

(lilt mrvomrzcAL: so 



(IX) flATOU: 

(A? wam/KMts Ott 
(B) tOCATXCM: 3,.15B9 

<D) Otm XHFORKATXCV: /pt«4uct- *Dblit pffDtttC-1 



(3tl) SBQOBJCS CWSCRimCM: filQ IS MO: 5: 
OC OCA ACA ATG OCC AQC OCC ACC Git GOO OOC GOO tOG CCO CTC OOC 47 

Ali ttr itee aIa ftr Al* -Rur v*l Alt XI* AU s*r Pro x*u Arg 

IS 10 IS 

OOC AQGGTCACCGQOCQCOGACACCGCGTCOQCCCGOOT TQC OCT ACC 93 
(Sly Arff V«l ifcr Oly Are Pro Hi* Are v»l Arg Pro Ary cy* Al* tte 
20 2S 30 

OOO ACC ACC GOG ACC GAG ACT COG GOG GOG COC OOC OTC OOO CTQ TCC 143 
Aim Ser s*r Al* Ttor Glu Ttir Pro Al* Al* pro Gly Val Ary tou S«r 
35 40 45 

G0CCM9GCmAttGTGGGCO0CO0CATCAGCGGCCTCT0CACCOOG 191 
Al* Glu cy» V*l Il« V*l Gly Al* oly ll« S*r Gly L#u Cym Iter Al* 
50 55 40 

CAG GOO CTO GOG ACC OGA *AC OOC OtC AOC OAC C1C CTC GTC ACC GAG 239 
Ola Alt I*u Aim Thr Arg fyr Gly v*l S*r Ajp L«u Imi Vml Tbr Olu 

65 70 75 

OC£CGOGACCOCCCOCOCGaCAACAtCACCA£CGrCQAGOgrOCCQtf 2r? 

41 
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U« Arv MP Ar« rro oiy oiy am 11 Thr To* 
•0 15 §9 

oaooootacctotoooaogooooacccaacaoc 
olu Oiy Tyr Lou Trp Olu olu oiy fro Am ft*r 

100 105 



Ml 

we 



ccGOTCCTCACCATaaccoiooACAOCooocTC 
fro Vol Lou Thr Mac Ala Val Asp tar flly low 
115 110 

TIC MO OAC COO MC 000 CCC COO WC OTO CTO 
PM Oiy Asp Pro Aan Ala Pro Arg Pho val Ub 

130 135 



AOO COO 090 COO tCO AM CCA OOC OAC CIO OCT 
Arg Pro vol fro Mr I*o Pro Oiy Aop Uv fro 
145 150 



P*» 

155 



Olu **f fro 



cec tec one 
ei* fro *■* ao? 

no 

ur qic mm 

AOO Lo* Val 

135 



QUO 000 AM CM 

Olu Oiy LyO bw 
140 

nc aoc ck mo 



AOT ATC CCT QOQ AJkO CTC AOS OCC OOC CTT OOC 
Sor zla Pro oiy x*o Loo Arg Ala Oiy tea Oiy 
1*0 145 170 

OCA CCT OCT CCA 000 C0C ttO OHO TC0 OKG GAG 
Pro Pro Pro fro Oiy Arg Glu Olu Box Vol Olu 

loo us 

AACCTCOOrOCCaAOOTCTTTOoOCOCCTCATC 
An LOU Oiy AlA Olu Vkl Pbo Olu AtO Uu Zlo 
195 300 



Olu 



OGT 0TA TAT OCT OOt GAT OCT TOO AAO CTT 

Oiy Vol Tyr Ala cly Aop fro Sor X#m Loo »ox 

210 315 

OOff AAO OTC tOO AOO TIC GAO 0A0 ATT QQA QOt 
Gly Lyi Vol Trp Arg Lou Olu Olu llo Oiy Oiy 
225 230 

ACC ATC AAO OCG ATT CAG GAT AAA 000 AAO AAC 
Thr Zlo Lyo Alo Zlo Olu AOP Lyo Gly Ly* AM 
340 345 250 

OAT CCC COA CTT COG OCA CCA AM O0A GAS AOO 
Asp Pro Ary Lou pro Ala Pro Ujfm Gly Gin Thr 
240 245 

AAO OGT CTA OCC ATG CTC CCG AAT OCC ATC OCA 
Lyv Gly Lou Al* **t Lou Pro Aaa Ala Zlo Ala 
275 200 

AAA CTC AAO CTG TCA TOG AAO CTT ACC AOC ATT 
Lyo Val Lyo teu Sox Trp Lyo Lou Thr Sor Zlo 
290 295 

CAA OOA TAT OTA TTA COT TAT 0AA ACA CCA OAA 
Olu Oiy tyr Val Lou Oiy tyr Olu Thr Pro Olu 
909 310 

CAO OCT AAA ACT OTT ATC ATG ACC ATC CCG TCA 
Gin Ala Lyi Oar Val Zlo oat Thr llo fro Sor 

320 325 330 



OlU 
ATG 



CTC W ATT CDC 
Lao Oiy Zlo An 

173 

TTT OTG CQC OOC 
fte Val Aro Arg 
190 

CCT TTC tOC TCA 
fro Ptoo Cyo tor 

205 



235 

CCC 
fro 



OTG 
Val 



TCT 
Sor 



ACA 
Thr 



OGA 
Gly 
315 

TAT 

tyr 



Lyo Ala Ala fbo 

220 

ATT ATT ODT OOA 

llo Zlo oiy oiy 



AAA CCG 
lyo fro 



OCA TCT 
Ala Sor 



A0C CTG 
Arg Lou 

365 

AAO OCG 
Lyo Ala 

300 

CTT GTT 
Lou. Val 



CCA AOO 

fro Arg 

2S5 

rrc AOG 
ft* Arg 

270 

GOT AOT 
Gly Sor 



GAC AAC 
Aop A*a 



TCA OTG 

Sor vol 



CTT OCT AO* GAf 

Val Ala Sor Aop 

335 



339 



303 



431 



479 



527 



9T4 



423 



471 



719 



747 



015 



043 



911 



959 



1007 





•y • 




• 


* 








t. jl t — 


Jk — t. 






. *• 


*• 
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' * 



atcttooqcccactttcaattoatoca — ..^ 

XX L«J^mX^***IloM»Al*AA«A^AlOI««*ri**9lw 
140 145 154 

TOT TAT COO CCA <WT OCT OCT OTA ACT CIrT TCA TAT CCA AAA AM OCT 
Tyr Tyr 9r* *r© Vil Al» AU Vil fhr Vml ta Tyr fr* OUi Al* 
355 340 145 

ATT AQA AAA QAA TOC TTA ATT OAT 090 0*0 CTC &0 OOt TIC OOC CM 
tlm Art Ly» OX» cy* b*u 21* Atp Oly OXu Mu oXsi OXy ftm Oly 01* 
370 JM 3*» 

no GAT OCA COT AOC CAA OQA OTC AMI ACT ffA OOB ACA *TA T*T A0C 
l*a Kit >*» Ar« A«r Ola Oly vkl OXu ite Lw OXy Thr XX# Tyr Mr 
303 390 395 

TCT TCT CRT TTT CCT AAT COT OCT OCT OCT OOA AflA OHO TTA CTT CTO 

400 403 410 41* 

am: tat atc oaa oor tct aca aat aca ooq atc ore w aao act cm 
Am Tyr U« OXy OXy 5«r Tfcx A*a Tfcr Oly XX* V*l mm* Mr* T*r alo 
420 42$ 430 

AG? GWC TTA OTA OOA QOC OPT OAC COT QAC CtC AOA AAA ATO TTO ATA 
&mx A*p ^Mi V*X OXy Al« V*l Asp Art A»p Xau Art; 0*t Itftt XlO 
435 440 449 

AJ^CTTAlU0CAOCAOM:CCTm0CAm000OTTCaAOIOTW» 
AAA Pro Art AX* AX* A#p fro IjM Ala Lm Oly VaX Arfl VAX Ttp Tro 
450 455 440 

CAA OCA ATA CCA CAO TTT TTC ATT OOC CAC CTT OAT OOC CTT OCT OCT 
Gin Ala Urn Pro ain Pbc l*u Ilm Qly Hi* hmp Ax* Vmi Al* Ala 
445 4?0 «?$ 



OCA AAA TCT OCA CTOOQCCAAOOCGOCTACOACOOOTTOTTC CTA 
Al* Ly* Sar Ala Uaw Oly QlA OXy OXy Tyr Amp OXy X4M fto* OXy 
480 445 490 495 

0©AAACTA£<nCO»O3AaTTGCCTTOO0CC»T0C^<M 
CXy A«n Tyr VAX AU OXy vtl aXm OXy Aro Cy« tl« Olu Oly AU 
500 505 5X0 

TAC GM? ACT OCC TCA CAA OTA TCT OAC TTC TTC ACC AAO TAT OCC TAC 
Tyr Glw Sar Ala Sw Ola Val Sax Ajpp Vfea b*u Tfer ttfi Tyr Al* tyr 
515 520 515 

AAO TOA tOSAASTWr OCATCTCTTC ATTTTCTTOC ATATACOAflG TGM3CTAOG 

ATOOCTAAAA CATCATQAGA TTCTOTAOTO TTTCTTTAAT tGAAAAAACA AAXTTTAOn* 
ATOCAATA3G T ULlVTOU: TOTAOTTCOA OCASVrAGAT C0QTA90O0A TAA A T I W1M 
TAAOCTATTC TOCAAAAPTA GTGATTTTTT T1QAAAAAAA AAAAA I AAAA AA 

(2) WMBMKTZm FOR SftQ IP N0;X0i 

<1> OURACmiSTZCS: 

(A) (MOTH: 524 «*iao *C±<J« 

43 



1*55 
110$ 
till 

im 

U4? 

1395 

1943 

13 tl 

1435 

1407 

1535 

15A3 

1439 

X4»» 
1755 
1011 
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P. 36 



w 

CP) 



» limar 



(11) 

fxl) 

Alt Thr wt 

1 

Arg v*l tte 

Mr Mr Ala 

IS 

oia cy« val 
so 

Ala iw JO* 
4s 

Aug Acp Axv 

Gly tyr i-ou 

val ii*u nr 

US 

Gly A*p Pro 

110 

Pro VmX Pro 

ICS 

Xla fro Qly 

Pro Pro pro 

Lmi Gly Ala 
195 

Val Tyr Ala 

210 

Ly« Val Tip 

225 

lis by* Ala 
Prt> Arg Lau 



TTPti 

DOCMPTZOai MO IB WOilOi 

«*rAlaftarV*lAUAlAAl«MrProtMAr*Qly 

S 10 is 

Oly Arg Pro via Arg val Art Pro Arg cyw Ala Ttor Ala 
20 25 10 

TttraluTbrProAlaAUProaiyValArvLMMrAU 
40 45 

XlO Val Qly Ala Qly XI* Mr Qly I*u Cya Af A1A Gi» 
33 40 

ArgTrrolyValacrAapLaot^DVaifteQlaAla 
70 75 00 

Pro Oly Oly am Ua Tta Ttar v»l Qlu at* Pro Aap Ql» 
«S 90 «5 



ftp Qlu Qlu Gly Pro 
100 105 



9faa Ola Pro 



Pro 



McAUValAapMrClyLauLyaAapAapLMValPlw 

120 US 

am Ala Pro Arg pfaa Val tmt Trp Olu Qly i*o 
IIS 140 



Mr Ly* Pro Qly AM Lau Pro Pba 

150 15$ 



140 



Qly Uu Ala 

275 

Val Lyo Lou 

290 



Lya bau Arc Ala Qly bai Gly Ala l^ni Qly Xla Arg Pro 

165 170 175 

Gly Arg Gin Qlu Mr Val Qlu olu ft* VH Arg Arg Am 
lao its ifo 

Olv val Fte Qlu Arg Lau Xla ala pro Pba Cya Mr Qly 
300 205 

QlyAapProMrLypLMMrMtXtfoAlAAlaPMCly 

215 220 

Arg X*u Qlu Glu Xla Gly Qly Mr Xla XI* Oly Oly Ifer 
230 235 240 

Xla Gin Aap t*a Qly Lya Am Pro Lya Pro Pro Arg MP 

24$ 250 255 

Pro Ala pro Lya Gly ala Tte Val Ala Mr Pba Arg Ly» 
2C0 245 270 

MatiaiiProfcMAlAXlaAlAMrArgUttaiyStflMi 
210 245 

Mr Trp Lya im Ite Mr Xla Ttar Iff Ala A-s> Am QXa 

295 300 
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Gly fyr V»l Ua Qly fyr 9lu itir ft« «r im av «la 

305 510 515 530 

XI Lyi lir Vtl Uf lltt Tto Ui r» str ?yr V«l Rr MP tl* 

315 330 335 



liu ktg Pro tmx 
340 



lit Mp Aap Ala Lau Mr VjfU tfea T#r 

345 350 



Tyr Fxo Pro vaI AU Ala vai ite v*l ter tyr »ro lor* oiu JOi tl« 
355 350 345 

Arg Ly» Olu Cya Lau Xla Mp Gly Glu Lau Qln Gly Mm Gly Gin La* 
370 375 300 

Eli fro Ary 5«r Sis Sly Vol Glu Tfcr Las Gly ftr Ha Tyr S*r dar 
355 390 355 400 

Sar Lau ftaa Pro Aan Arg Ala Pro Ala Oly Arg vai Uu sau l*u Aaa 
405 410 415 



31 



tyr IU Oly Gly Sar Thr Aan Tta Qly Xla Val tar Lya tte Olu aa* 
420 435 430 

Aap Lau Val oly Ala val Aap Arg Asp Lau Arg Lya Mat Utt Ila Am 
435 440 MS 



Pro Arg Ala Aim Aap Pro Lau AU Lau Oly Val Arg Val Trp tr© Ola 
450 455 460 

Ala Xla fto Gin Mm Lau Xla oly His Lau Aap Arg lav Ala Ala Ala 
455 470 475 400 

Lya Sax Ala Lau Gly Gin Gly Gly Tyr Aap Qly Lau Pha Lau Gly Gly 
485 490 495 



Aaa Tyr Val Ala Gly Val Ala Lau Gly *rg Cy* lis Glu Gly AU Tyr 

SOO 505 510 

Glu sax Ala sar Gla Val Sar Aap Pfaa Lau Ttor Lya Tyr Ala tyr Ly* 

515 530 525 



(2) nramrxov teat ssq zd ja>:ii; 



ti) 



CfMACTKRISHCSf 
(A) IBjttTB: 1047 hasa paira 
t»> TYPE: nuelsic acid 
(O sr*MOBMt*s&> singla 
(D) topology* liaaar 



(ilk 

(ill) HYPOTHETICAL: no 



(lx) FXATOH: 

(A) NAMI/m: CHS 

tm) LOCATI ON: 55 , -1413 

(D) 0XBSI XMraAKATXQJi: /product" 'aoybaan protooc-1 
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(«1 J 
HTTAOCACA 



GOT TCC 
Val Mr Vol 



ooc ccc tec 
Arg Pro Sor 

20 

AAA TTC CCT 
Lya Pbo Pro 

QMS OAA TCC 
Glu Olu £or 

50 

ON? roc ore 



OOC CTC OCC 
Ala LOU Ala 



CQA GAC C(£ 
Ary Aop Arg 

1QP 

CTC TOO GAA 
Lou Trp Glu 
115 

acc aio oxo 

Thr Mot Vol 

130 

CCT GAT CCA 
Pro Aop Ala 



CCC 000 AAG 
Pro Oly Lyw 



OMCK IP TIOM* Z9 MOilli 

TAACmoCQA ATAOTOCCAT 73CTOTAACC 



OOC AAA ATC 
Gly tyi Xlo 
180 

CCA GOT CAT 
Pro Oly fll» 
195 

oat oao <nr 

A*p OlU Vfel 
210 



TTC AAC 
Mm An 

5 

CTC CAT 
LOU KU 



OOC TCT 
Arg flor 



ACC OCO 
Ttir Ala 



<rc ore 

Vol VAl 

70 

ACC AAA 
Tbr Lyo 
05 

ore ooc 

Vol Oly 



GAA 0QC 

Glu Oly 



OTO OAC 
Val Aop 



CCT COO 
Pro Arff 

150 

CTG ACT 
Lou Thr 
1Gb 

AOG OCT 

Arg Alo 



OAO ATC CTA 
Olu III Lou 



TCC CCA ACC 
Sot Pro Thr 
25 

coc CCT AAC 
Arg Pro AMR 
40 

TCT CCO CCC 
Aar Pro Pro 
55 



TIC CCO COO 
fftM Pro Pro 

10 

TCT TTC TTC 
SOX PIM Pbo 



CCT ATT CTA 
Pro Xlo Lou 



AAA t£C A£A 
XOfo Thr Arg 
€0 



AAC CAA ACC 
AM ola Tte 
15 

ACC TCT CCC 

Ifar Sor Pro 
30 

COC IOC TCC 
Arg Cyo for 
4$ 

one tec ooc 

Aop 9ft 



AACC ATO 
Mot 

1 

CTT CTT 



OA OOC ore AOC OOC 
oiy oly oiy v»l Aor Gly 

75 



GAG OAA 
Olu OlU 



TTT GAA 
Pha Qlu 



CAC OCC AAT 
Hio Ala AID 



OOC AAC ATC 
Oly A« Xlo 
105 

CCC AAC AOC 
Pro A*n Sor 
120 

AST OCT TTA 

stray Liu 

135 

TTC ore TCO 
Pbo Vol Lou 



GAT TTG CCT 
A*P LOU Pro 
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